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FIELD OF THE INVENTION 

The present invention provides integrase interacting proteins and more particularly 
cofactors which promote strand transfer activity of viral integrase, more particularly HIV 
integrase, and methods and uses relating thereto. The present invention relates to a 
cellular protein that associates with integrase (integrase interacting protein - Ihip), to 
molecules interacting with Mp and their use as an antiviral. The present invention also 
relates to antibodies, RNA interference, antigene ther^y, gene silencing or antisense 
inhibition of said integrase interacting protein. The novel integrase interaction protein is a 
target for HTV replication prevention or inhibition. 

More particular, the present invention identifies flie cellular protein, Inip76, found to be 
identical to LEDGF/DFS70/p75 as a protein that interacts with HTV-integrase. 
Coexpression of lhip76 specifically enhances, and depletion of Imp76 decreases retroviral 
genome integration into cellular chromosomal DNA. These results implicate Inip76 as a 
specific coactivator of retroviral integrase. 

BACKGROUND OF THE INVENTION 

Establishment of the provirus, a DNA copy of the viral genome integrated into the host 
cell chromosome is an obligatory step in retroviral replication. Stable integration into the 
human genome is one of the primary reasons for the persistence of the human 
immunodeficiency virus (HIV) infection, which leads to the acquired immunodeficiency 
sjTOirome (ADDS). Therefore, fflV integrase (IN), the enzyme orchestrating the insertion 
of the DNA replica of the viral genome into the cellular chromosomal DNA, is a potential 
target for antiretroviral therapy. The recent discovery of the new lead compounds able to 
protect cells against HIV infection by inhibiting integration raises hope that integrase 
inhibitors will be added to future combination cocktails (for reviews see Debyser et al., 
2002; Miller and Hazuda, 2001; Pani and Marongiu, 2000; Pommier et al., 2000). 
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Mechanistically and structurally, retroviral integrases (reviewed in Asante-Appiah and 
Skalka, 1997; Brown et al., 1987; Craigie, 2001; Hindmarsh and Leis, 1999) are similar to 
the' well-studied prokaryotic Mu and Tn5 transposases and belong to a family of DNA 
strand transferases that catalyze DNA strand cleavage/ligation via direct trans- 
5 esterification. In the course of viral integration, HIV IN performs two enzymatic 
reactions: first is the removal of the 3'- GT dinucleotides firom the long terminal repeats 
(LTRs) (i.e. the 3 '-processing reaction). The second reaction is the insertion of the 
recessed 3'- viral DNA ends into the opposite strands of the target DNA wherein the 3' 
hydroxyls of the processed LTR ends attack two phosphodiester bonds in the target DNA 

10 molecule (tiie strand-transfer reaction). Insertion of the two viral LTRs takes place in a 
coordinated fashion across the major groove of the target DNA. As a result, the integrated 
provirus is flanked by two 5-nucleotide gaps as well as two mismatched 5 '-AC 
dinucleotides, which are probably repaired by cellular enzymes (Yoder and Bushman, 
2000). HIV IN, like other retroviral integrases, is comprised of three structural domains. 

15 The amino terminal domaiu contains the HHCC zinc finger motif, coordinates Zn and 
appears to be important for formation of the active IN multimers. The central, catalytic or 
"core" domain (residues 50-212) contains the active site (i.e. the DDE motif composed of 
D64, D116 and E152) and is structurally very similar to the catalytic domains of Mu 
phage and Tn5 transposases. In the context of the fiiU-length enzyme, tiie core domain is 

20 thought to make sequence-specific contacts with flie DNA, The carboxy terminal 
fiiagment possesses unq)ecific DNA-binding activity and is critical for multimerization of 
the enzyme. All single domain firagments of IN are dimeric when produced separately; the 
full-length HIV IN as well as other retroviral integrases form multimers under the 
conditions of the in vitro enzymatic assays (Ellison et al., 1995; Engelman et al., 1993). 

25 Recombinant HIV and avian sarcoma virus (ASV) integrases have been shown to behave 
as mixtures of monomers, dimers and tetramers (Coleman et al., 1999; Deprez et al., 
2000; Jenkins et al., 1996). The presence of octamers and larger complexes has been 
suggested by some studies (Lee et al., 1997; Leh et al., 2000). Virion-associated HIV IN 
was shown to be in an oligomeric form, whereby dimers and higher order multimers 

30 appeared to be stabilized by disulfides, although flie complexes were not studied under 
native conditions (Petit et al., 1999). Earlier, IN derived firom avian myoblastosis virus 
(AMV) particles was demonstrated to be multimeric, behaving at least as a dimer 
(Grandgenett et al., 1978). 
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The 3 '-processing and strand-transfer reactions can be reproduced in vitro using 
recombinant enzyme preparations and DNA substrates that roimic LTR ends. In vivo 
retroviral integration is preceded by the assembly of a stable and compact preintegration 
complex (PIC), that contains a 9.5 kb linear DNA copy of the viral genome associated 
5 with viral and cellular proteins (Brown et al., 1987; Bukrinsky et al., 1993; Ellison et al., 
1990; Famet and Haseltine, 1991; Miller et al., 1997). Several cellular proteins have been 
suggested to play auxiliary roles during retroviral integration. Thus, the barrier-to- 
autointegration factor (BAF) has been reported to protect Moloney murine leukemia virus 
(MoMLV) PICs against suicidal self-integration (Lee and Craigie, 1998). Another cellular 

10 protein, HMG-I(Y) was found in HIV PICs and £^peared to be essential for their 
integration activity in vitro (Famet and Bushman, 1997; Miller et al., 1997). Conversely, 
BAF could substitute for HMG-I(Y) at least in vitro, partially restoring integration 
activity of salt-denatured HIV-1 PICs (Chen and Engehnan, 1998). Yet, BAF remains to 
be shown to co-fractionate with retroviral PICs. Both BAF and HMG-ICY) are small 

15 DNA-binding proteins able to bridge and deform DNA molecules (Reeves and 
Beckerbauer, 2001; Zheng et al., 2000). They are thought to play stmctural roles within 
retroviral PICs, possibly jxxxtaposing both LTRs. Similarly, Mu phage transposase as well 
as enzymatically-unrelated X phage integrase require the DNA-bending host proteins IHF 
and/or HU to form committed synaptic complexes (Friedman, 1992; Mizuuchi, 1992). 

20 IHF and HU appear to substitute for each other in promoting formation of the active X 
integrase-DNA complexes in vitro (Segall et al., 1994). Another potential co-factor for 
HIV integration, the integrase interactor 1 (Inil or hSNF5/BAF47), was originally 
discovered in a yeast two-hybrid screen for human proteins interacting with HTV-l IN 
(Kalpana et al., 1994). Inil is a subunit of the 2 MDa SWI/SNF chromatin-remodeling 

25 complex (Wang et al., 1996). It has been proposed that Inil plays a role during retroviral 
replication by directing the PICs to open chromatin regions (Kalpana et al., 1994) or by 
modulating expression of the integrated provirus (TureUi et al., 2001). Recent studies 
demonstrated that GFP-tagged Inil was exported from the nuclei of infected cells and co- 
localized with incoming sub-yiral particles (TureUi et al., 2001). Inil has also been 

30 reported to enhance the release of infectious HIV particles; detectable amoimts of Inil 
have been shown enc^sulated within virions (Yung et al., 2001). 
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In order to develop effective therapies for the treatment and prevention of retroviral virus 
infectious diseases, such as AIDS, elucidation of the mechanisms by which it replicates in 
the human host is essential. Therefore, it is apparent that there still exists a need in the art 
for the identification and characterization of the cofactors which participate in the 
5 retroviral integration process. 

The cloning and characterization of the protein Lens Epithelial Cell derived Growth 
Factor (LEDGF) is described in PCT/US98/0980L The patent application describes the 
stimulating activity of LEDGF on the proliferation of different cell types, particularly 

10 those of epithelial character, such as lens epithelial cells. In general, methods for treating 
cancer, inducing wound healing, inducing or inhibiting cell-death, cell differentiation and 
cell proliferation are provided. Also antibodies against LEDGF are described. 
The LEDGF protein has been described as the positive transcription cofactor PC4- 
interactmg protem (Ge et al., 1998). Two alternatively spUced cDNA clones were isolated 

15 fi-om a HeLa library coding for two proteins p52 (333 amino acids, as predicted fi-om the 
cDNA) and p75 (530 aminoacids) sharing identical 325 amino terminal residues. The 
transcripts coding for p75 and p52 were detected in different cell types and tissues, p52 
being most abxmdant in testis and p75 in thymus. Ihdependentiy, a cDNA clone coding 
for a protein identical to p75 has beeti isolated from a lens epithelium cell library (Singh 

20 et al., 1999). Overexpression of the protein stimulated survival of diverse primary cells 
and cell lines and enhanced their resistance to oxidative and hyperthermic stress (hence 
LEDGF, "lens epithelium derived growth factor"). The same protein has also been 
identified as the DFS70 autoantigen, antibodies to which were found in some cases of 
atopic dermatitis, asflmia and interstitial cystitis (Ochs et al., 2000). One case of acute 

25 myeloid leukemia with a chromosomal translocation resulting in a fiision of the NUP98 
and LEDGF genes has been described (Ahuja et al., 2000). 



SUMMARY OF THE INVENTION 

30 

The present invention demonstrates that in the nuclear extracts of human cells stably 
expressing the HIV integrase protein firom a synthetic gene, IN exists as part of a large 
distinct complex with apparent Stokes radius of 61 A, which dissociates upon dilution 
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yielding a core molectile of 41 A. The present invention furthermore provides that the 
41 A core is a tetramer of IN, whereas 61 A molecules are composed of IN tetramers 
associated with a cellular protein with an apparent molecular weight of 76 kDa. In the 
present invention, this integrase interacting protein (Inip76) has been identified as being 
5 identical to LEDGF/DFS70/p75, a protein implicated in regulation of gene expression and 
cellular stress-response. HIV-1 IN and Imp76 co-localized in the nuclei of human cells 
stably expressing IN. Furfhemiore, the present invention demonstrates that recombinant 
Inip76 strongly promotes strand-transfer activity of HIV-1 IN in vitro. The present 
invention thus provides that the minimal IN molecule in huntian cells is a tetramer and that 
10 Inip76 plays an important role in retroviral integration. 

Therefore the present invention relates to proteins of the hepatoma-derived groAvth factor 
family, more in particular Inip76, the polynucleotides expressing them and molecules 
binding or interacting with the proteins or the polynucleotides. The present invention 

15 relates to isolated and purified polynucleotide encoding the Inip76 protein, the complex of 
Iaip76 with the integrase aad their use. The present invention also relates to the Inip76 
protein and its use. Furthermore, the present invention relates to molecules which 
comprise a region specifically binding to the Inip76 protein of the hepatoma-derived 
growth factor family or nucleic acids encoding said proteins and which suppress or 

20 prevent retroviral replication and to their use. The present invention also relates to 
antibodies to Imp76 and to antisense oligonucleotides, antigene therapy, RNA 
interference and ribozymes for inhibiting the expression of Iiiip76 and tiieir use. 
The present invention also relates to a protein complex comprising a retroviral integrase 
and Inip76, each of them natural or recombinant. The invention also relates to a method 

25 of stimulating retroviral integration using Inip76 or functional analogues thereof and to 
the use of Inip76 as a cellular cofactor of retroviral integration. 

Furthermore, the present invention relates to the HlV-integrase, characterised in that said 
integrase is in its tetrameric form and to its use, 

30 The present invention relates to the use of an isolated and purified polynucleotide 
encoding the Inip76 protein or an intermediate or a firagment of said protein, an allelic 
variant, a homologue, a portion or a mutation thereof for the prevention, treatment or 
diagnosis of viral infections, more in particular a retroviral infection, yet more 
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particularly a Irativiral infection and still more in particular a HIV infection. In a 
particular embodiment, the present invention also relates to the use of said 
polynucleotides for the manufacture of a medicament for the prevention and/or treatment 
of viral infections or for the manufacture of a diagnostic tool in the viral field, more 
5 particularly the HIV-field and to the use of said polynucleotides for the screening of 
molecules for their anti-viral activity. Another embodiment of the present inventin relates 
to a polynucleotide ^coding the Inip76 protein or an intermediate or a firagment of said 
protein, an allelic variant, a homologue, a portion or a mutation thereof, further 
comprising a polynucleotide which codes for a least a portion of the HIV integrase, more 
10 in particular the combination of both polynucleotides being arranged in such a way that a 
fusion protein results after expression. 

The present invention also relates to vectors comprising at least a polyaucleotide 
encoding the Inip76 protein or an intermediate or a firagment of said protein, an allelic 
variant, a homologue, a portion or a mutation thereof and to the use of said vector for the 
15 manufacture of a medicament for the prevention and/or treatment of viral infections or for 
the manufacture of a diagnostic tool in the viral field, more particularly the HIV-field and 
to the use of said vector for the screening of molecules for their anti-viral activity. 

The present invention furthermore relates to the use of the proteins of the hepatoma- 
20 derived growth factor family, more in particular the Inip76 protein, a homologue, a 
variant, a mutated form or a firagment thereof, for the manufacture of a medicament for 
the prevention and/or treatment of a viral infection, more in particular a lentiviral 
infection, yet more particularly a HIV-infection. In a particular embodiment the present 
invention relates to the use of said Inip76 protein, a homologue, a variant, a mutated form 
25 or a firagment thereof, for the manufacture of an HIV-related diagnostic tool, for the 
screening of molecules for their, anti-viral activity or for the crystalUsation of a lentiviral- 
integrase. 

The present invention furthermore relates to the use of a molecule which comprises a 
30 region specifically binding to a protein of the hepatoma-derived growth factor family, 
more particularly Inip76 or nucleic acids encoding said protein of the hepatoma-d^ved 
growth factor family, for the manufacture of a medicament for the prevention and/or 
treatment of a viral infection. In a particular embodiment, said molecules are 
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characterised in that they prevent or suppress the protein of the hepatoma-derived growth 
factor family of interacting with retroviral integrase. In particular embodiments, these 
molecules can be selected from the group comprising an antibody or any fragment 
thereof; a small molecule specifically binding to Inip76 or to nucleic acids encoding said 
5 Inip76; a ribozyme capable of cleaving nucleic acids encoding Ihip76; antigene 
compounds and anti-sense nucleic acids or analogues thereof, capable of inhibiting the 
expression of Imp76 by interacting with nucleic acids encoding Ihip76. It is tiierefore an 
object of the present invention to provide antibodies to the Integrase-interacting protein 
(INIP76) and its subunits, and methods for their preparation, including recombinant 

10 means. In another particular embodiment the present invention relates to small interfering 
RNAs and vectors encoding said siRNAs specific for Inip76 and to their use for the 
manufacture of a medicament for the prevention and/or treatment of viral infections.The 
present invention thus relates in general to the use of RNA interference directed against 
Inip76 for the prevention and treatment of lentiviral infections. In another particular 

15 embodiment, the present invention relates to the use of small molecules capable of 
modulating the interaction of a protein of the hepatoma-derived growtii factor family with 
integrase, for the manufacture of a medicament for the prevention and/or treatment of a 
viral infection. 

20 It is an object of the present invention to provide a method and associated assay system 
for screening substances such as drags, agents and the like, for their activity in combating 
the adverse effects of the Integrase-interacting proteins (EsflPs) and/or its subimits in 
retrovirus infected mammals. The present invention also relates to a method to identify 
molecules which comprise a region specifically binding to a protein of the hepatoma- 

25 derived growth factor family, more in particular Imp76 or nucleic acids encoding said 
protein of the hepatoma-derived growth factor family, comprising the steps of exposing 
said protein or nucleic acids encoding said protein to at least one molecule whose abiUty 
to suppress said protein of interacting with a retroviral integrase protein is sought to be 
determined, followed by deteimining the interaction (i.e. binding, hybriddisation) of said 

30 molecule(s) to said Inip76 protein, to the binding places on teh integrase of said Inip76 
protein or to nucleic acids encoding said protein and finally monitoring the prevention or 
suppression of retroviral replication or integration by at least one of said molecules. A 
particular embodiment of the present invention relates to the above described method 
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where the abiHty of the molecules to suppress the promoting activity on the strand 
transfer activity of HIV is determined. 

On aspect of present inventions is to improve integration of a transgene in a mammalian 
5 host cell genome by a method to introduce integrase-interacting proteins (INIPs) (gene 
therapy), preferably Inip76, or an active fiagment thereof to increase the amount of 
integrase-interacting proteins (INIPs), preferably hrip76, or an active firagment thereof in 
said mammalian cell by methods of transfection known to the man skilled in the art (e.g* 
liposome deUvery). In another embodiment, this method may be combined by with 
10 transfecting host cell with heterologous or exogenous DNA encoding integrase or an 
active fragment thereof. 

In a further embodiment, the present invention relates to methods of isolating and 
characterizing a novel Integrase-interacting protein (DSflP). 

15 

In a particular embodiment, the present inveation relates to integrase-interacting proteins, 
active fragments thereof, agonists thereof, mimics thereof, and combinations thereof, said 
protein having the following characteristics: 

a) it binds with the retroviral Integrase and stimulates the activity of retrovhal 
20 Integrase and/or stimulates the DNA binding activity of retroviral Integrase and/or 

stimidates the strand-transfer activity of retroviral Integrase 

b) it possesses a molecular weight between 66 kDa to 86 kDa or an apparent 
molecular weight of approximately 76 kDa as determined by SDS-polyacrylamide gel 
electrophoresis. 

25 Accordingly, it is an object of the present invention to provide an Integrase-interacting 
protein (INIP76), which possesses an apparent molecular weight of approximately 76 kDa 
as determined by SDS-polyacrylamide gel electrophoresis and which exhibits certain 
characteristics and activities associated with the interaction of the Integrase-interacting 
protein (INIP) with integrase and their combined retroviral integration activity. 

30 It is another object of the present invention to provide a method for the prevention and/or 
treatment of mammals from a retorviral infection to control the amount or activity of fee 
Integrase-interacting proteins (INIPs) or subunits feereof. 

8 
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The present invention also relates to a composition comprising a) molecule which 
comprises a region specifically interacting with a protein of the hepatoma-derived growth 
factor family or nucleic acids encoding said protein of the hepatoma-derived growth 
factor family, and b) one or more compounds ejQfective in the treatment or prevention of 
5 viral infections. 

It is an object of the present invention to provide a method for the treatment of mammals 
to control the amount or activity of the Ihtegrase-interacting proteins (INIPs) or subimits 
thereof, so as to treat or avert the adverse consequences of invasive, spontaneous or 

1 0 idiopathic pathological states resultant from HIV infection. 

It is a still further object of the present invention to provide pharmaceutical compositions 
for use in therapeutic methods which comprise or are based upon the Ihtegrase-interacting 
protein (I1SIIP76), its subunits, their binding partner(s), or upon agents or drugs that 
control the production, or that antagonize the activities of the Integrase-interacting protein 

15 (INIP76). 

Other objects and advantages will become apparent to those skilled in the art from a 
review of the description which proceeds with reference to the following illustrative 
drawings. 

20 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Extraction of IN from nuclei of 293T-IN^ala cells. 

25 (A) 293T-riM^ala cells were lysed in lOOCSK buffer in the presence of 4 iig/m\ digitonin 
or 0.5% NP40 on ice for 10 minutes. The supernatant (S) and nuclear pellet (P) fractions 
were recovered and analyzed by western blotting with an anti-IN antibody. The first lane 
contains the total cell extract. 

(B) 293T-IN'ala cells were lysed in lOOCSK buffer supplemented with 0.5% NP40 on ice 
30 for 10 minutes, and the extracted nuclei were re-suspended in CSK buffer containing 0 - 
500 mM NaCl. After centrifugation, supematants (S) and nuclear pellets (P) were 
analyzed by western blotting with an anti-IN antibody. The total nuclear protein was 
loaded in the first lane (nuc). 

9 
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(C) NP40-permeabilized nuclei from the 293T-IN^ala cells, prepared as above, were 
incubated in lOOCSK buffer with (+DNaseI) or without (-DNasel) DNAsel 
(250 units/ml) at 25 ""C for 10 (10') or 30 (30') minutes. Following centrifugation, the 
supCTnatants (S) and pellets (P) were separated in an 11% SDS PAGE gel, the upper part 

5 of which was used for the immunoblot to detect MCM3 (91 kDa) and the lower part to 
detect IN (32 kDa). The total cytoplasmic and nuclear protein fractions were loaded in the 
first (cyt) and the second (nuc) lanes, respectively. 

(D) NP40-penneabilized nuclei from 293T-IN*ala cells were incubated in lOOCSK buffer 
with or without DNasel (250 units/ml) for 10 minutes, pelleted by centrifugation and 

10 resuspended in ice-cold hypotonic buffer (2mM EDTA, 2mM Hepes [pH7.5]). After 
centrifiigation, supematants (S) and pellets (?) were analyzed by western blotting with 
anti-IN antibodies. The first lane (nuc) contains total nuclear protein. 

Figure 2. Cross-linking of the IN and BVf complexes with DTSSP. 

15 (A) The nuclear extract from 293T-IN^ala cells was prepared in tiie 400CSK buffer, was 
cross-linked with DTSSP and separated in a non-reducing 4-12% SDS PAGE gel. The 
IN-containing cross-linking adducts were detected by western blotting with a polyclonal 
anti-IN antibody. Prior to cross-linking, the extract was adjusted to 100 jig/ml (lanes 1-5), 
20 iig/ml (lanes 6-9) or 4 jxg/ml (lanes 10-13) of total protein. The concentration of 

20 DTSSP was 0.1 mM (lanes 3, 7, 11), 0.5 mM (lanes 1, 4, 8, 12) or 2.0 mM (lanes 5, 9, 
13). No cross-linker was added to the samples in lanes 2, 6 and 10. The sample in lane 1 
was cross-linked in the presence of 0.2% SDS. 

(B) Cross-linking of the FLAG-tagged IN complexes was done in similar conditions as 
in (A). Only lanes containing samples cross-linked with 2 mM DTSSP are shown; the 
25 sample in lane 1 was cross-linked in the presence of 0.2% SDS. The pSOO""^, pi 50^^, p60''\ 
the IN monomer bands and the positions of the MW markers are indicated. The «470 kDa 
mark corresponds to the band of the catalytic subunit of DNA PK (MW 469 kDa), which 
was detected in a separate lane with a monoclonal anti-DNA PKcs antibody. 

30 Figure 3. Determination of sizes and molecular weights of the IN complexes. 

(A) Chromatography of nuclear extracts from 293T-IN^ala cells was carried out on a 
calibrated Superdex 200 column. Prior to chromatography, the extract was adjusted to 
550 iig/ml or 30 |ig/ml of total protein; the collected fractions (1-26) were tested for the 
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presence of IN by western blotting. The elution volumes (Ve) and the respective partition 
coefficients (ICav) for the observed IN peaks are indicated. 

(B, C) Determination of the Stokes radii and approximate molecidar wei^ts of the IN 
complexes from the experimental Kav values. The partition coefficients for the standard 
5 proteins were determined in the same conditions (thyr., thyroglobulin, Kav = 0.039; 
ferritin, Kav = 0.17; catalase, Kav = 0.26; aldolase, Kav = 0.28; BSA, bovine serum 
albumin, Kav = 0.39; chymotr., chymotrypsinogen A, Kav = 0.59). 

Figure 4. Immunoprecipitation of the FLAG-tagged IN from diluted nuclear 
10 extracts. 

(A) Nuclear extracts prepared from metabolically labeled 293T-IN*ala and 293T- 
IN^alaFLAG cells were diluted to 200 |ag/ml of total protein and immunoprecipitated with 
the anti-FLAG M2 antibody and protein G agarose beads for 16 hours at 4 The protein 
was eluted by boiling in SDS PAGE sample buffer and separated in a 4-20% SDS PAGE 

15 gel. Radioautograph of the gel is presented. 

(B) FLAG-tagged IN was immunoprecipitated from a diluted nuclear extract of non- 
labeled 293T-IN^alaFLAG cells overnight The protein was eluted with FLAG peptide in 
400CSK buffer and cross-linked with DTSSP. The reaction conditions are similar to those 
in Figure 2. 

20 (C) The protein immunoprecipitated and eluted as in (B) was subjected to gel filtration on 
a Superdex 200 column. The fractions collected (1-26) were analyzed by western blotting 
with anti-IN antibodies. The Rs value corresponding to the observed peak was determined 
as in Figure 3B. 

25 Figure 5. Co-immunoprecipitation of FLAG-tagged IN with Inip76 from non-diluted 
nuclear extracts. 

(A) Nuclear extracts from 293T-IN'alaFLAG and 293T cells (700 fig/ml of total protein) 
were incubated with anti-FLAG M2 antibody and protein G agarose beads for 4 hours. 
The beads were washed as described in materials and methods and bound proteins were 
30 eluted with FLAG peptide in 400CSK buffer. The eluted proteins were concentrated by 
precipitation with TCA, re-dissolved in SDS sample buffer, separated m 4-20% 
denaturing PAGE gels and visualized by silver staining. Lane 1: immunoprecipitate of a 
nuclear extract from 60x10^ 293T-IN^alaFLAG cells. Lanes 2 and 3: 
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inunnnoprecipitation was done in parallel with nuclear extracts from 293T and 293T- 
IN^alaFLAG cells. The bands of INf, Inip76, the heavy and the light chains of the anti- 
FLAG M2 IgGl antibody as well as aprotinin (protease inhibitor presCTit in 400CSK 
buffer) are indicated. Two sets of molecular weight markers were used in both gels to 
5 detemune the apparent molecular weight of lDip76. Positions of the molecular weight 
markers are shown. 

(B) Nuclear extract from 20 xlO^ 293T-IN^alaFLAG cells («700 |Lig/ml total protein) was 
immunoprecipitated for either 4.5 hours (left lane) or 18 hours (right lane). The protein 
was then eluted and analyzed as in (A). 

10 

Figure 6. Iiiip76 is identical to LEDGF/DFS70/p75, appears to be a component of the 
61 A complex and stimulates IN activity in vitro. 

(A) The Coomassie blue-stained PVDF membrane used for amino-terminal 
microsequencing of Inip76. The lNf-Inip76 complex was eluted from protein G agarose- 

15 immobilized anti-FLAG M2 antibody with FLAG peptide, separated in reducing 4-20% 
SDS PAGE gel and transferred onto the PVDF membrane (lane 1). The proteins left on 
the beads after incubation with FLAG peptide were eluted with SDS sample buffer (lane 
2). The bands corresponding to Inip76, INf, the heavy (H) and light (L) chains of the anti- 
FLAG M2 antibody, aprotinin and the MW markers are indicated. 

20 (B) Deconvoluted and sequence-converted MS/MS spectrum of a doubly charged peptide 
ion (jniz 982.57) obtained from the in-gel tryptic digest of Inip76 corresponding to the 
LEDGF peptide N425-K442. The observed b- and y-dominant fragment ions are 
indicated (Biemann, 1990). 

(C) Co-iromunoprecipitation of LEDGF and FLAG-tagged IN from a nuclear extract of 
25 293T-IN^alaFLAG cells. Immunoprecipitation was carried with anti-FLAG (lanes 2 and 

2'), anti-LEDGF (lanes 3 and 3') or no antibody (lanes 1 and V). After 4 hours 
incubation, protein G agarose beads with precipitated protein complexes were washed 
with three changes of 400CSK buffer and re-suspended in reducing SDS PAGE sample 
buffer, followed by western blotting to detect INf and Inip76/LEDGF (lanes I'-S'). The 
30 lanes 1 -3 contain the immunoprecipitation supematants. 

(D) Elution of the 61 A IN complex from a gel filtration column is shifted after pre- 
incubation with anti-LEDGF antibody. A nuclear extract of 293T-IN^alaFLAG cells was 
preincubated with 3 fig/ml anti-HA (control IgGl) or anti-LEDGF antibody and separated 

12 
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by chromatography on a Superdex 200 column. INf was detected in tibe Jfractions by 
western blotting. The void volume of the column was 8.3 ml, approximately 
corresponding to fraction 2. 

(E) Cross-linking of the Inip76-INf complex with DTSSP. The Inip76/INf complex was 
5 immimoprecipitated from a nuclear extract prepared from 293T-IN^alaFLAG cells with 

anti-FLAG M2 antibody and protein G agarose for 4.5 hours. The protein was eluted with 
FLAG peptide and incubated with 2mM DTSSP in the presence (lanes 1 and 1') or 
absence (lanes 2 and 2') of 0.2% SDS. The cross-linked samples were then separated in a 
non-reducittg 4-12% SDS PAGE gel and immunoblotted with polyclonal anti-IN (left 
10 blot) or monoclonal anti-LEDGF (right blot) antibodies. The positions of INf and Inip76 
as well as the cross-linking adducts plSO'^*, pSOO''^ pHMWi*"^ and pHMWa""^ are indicated. 
Anti-FLAG M2 IgGl present in the sample is detected on the anti-LEDGF westem blot. 

(F) Recombinant Inip76 enhances HIV-1 IN strand transfer activity in vitro. Mini-HTV 
DNA was pre-incubated with HIV-1 IN for 7 minutes at room temperature. Next, 

IS 0-0.8 \JiM His6-tagged Inip76 was added to the reactions that were frnther incubated at 
37°C for 90 minutes. The concentrations of IN and Imp76 used in the reactions are 
indicated. The reactions contained 125 ng mini-HIV DNA, 1 10 mM NaCl, 20 mM Hepes 
[pH 7.5], 5 mM DTT and 5 pM ZnCh in a final volume of 20 \xL The reactions were 
stopped by addition of 0.5% SDS and 25 mM EDTA and the samples were digested with 

20 0.25 mg/ml proteinase K at 37 °C for 30 minutes to completely disrapt protein-DNA 
complexes. DNA was then precipitated with ethanol, re-dissolved in Tris-EDTA and 
analyzed by electrophoresis in an 0.8% agarose gel. The positions of the DNA molecular 
weight markers (23.1, 9,4, 6.6, 4.4 and 2.3 kb) are indicated. The gel was stained with 
SybrGold (Molecular Probes). 

25 

Figure 7. Co-localization of FLAG-tagged IN and Inip76/LEDGF in 293T- 
IN^alaFLAG cells. 

(A) Confocal laser scanning micrographs of a fixed and permeabilized cell fluorescently 
stained with a combination of monoclonal anti-LEDGF plus Alexa-488 conjugated anti- 
30 mouse antibodies to detect Inip76 (green, Imp76) and rabbit polyclonal anti-FLAG plus 
Alexa-555 conjugated anti-rabbit antibodies (red, DSf) to localize FLAG-tagged IN. DNA 
was stained with To-Pro3 iodide (shown as white). The two-color merged image 
(IN+Inip76) was produced by overlaying the IN and Inip76 images. 
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(B) Both INf and Inip76 axe associated with condensed chromosomes during mitosis. 
Immunofluorescent staining was performed as described in (A). 

(C) DNA PKcs and INf display no significant co-localization. INf (red, IN) was detected 
as in (A); DNA PKcs (red, PKcs) was localized with monoclonal anti-DNA PKcs 

5 antibody plus Alexa-555 conjugated anti-mouse antibody. The two color IN+PKcs image 
is an overlay of the IN and PKcs images. 

Figure. 8. Effect of RNA interference-mediated knock-down of Iiiip76/LEDGF 
expression in 293T-IN*alaFLAG ceUs on HIV IN distribution. 

10 Inip76 (A, B) and IN (C) were detected by indirect immunofluorescoice in non- 
transfected cells (A), cells transfected with 76A siRNA (B, C). Immunofluorescence, 
chromosomal DNA staining and detection was done as described in Example VI. 

Figure 9. Purification of the Inip76-IN complex from bacteria co-expressing the two 
15 proteins. 

(A) Elution of the IN-Imp76 complex fi-om NiNTA Sepharose; 7 fi-actions collected were 
analyzed by 11 % SDS PAGE and Coomassie staining. (B) A lysate of induced 
PC2LEDGF cells lacking Hisa-tagged IN was incubated with NiNTA Sepharose and the 
bound proteins were eluted with 200 mM imidazoL The band corresponding to hiip76 is 

20 not present on this gel, confirming that fi-ee Inip76 is not associating with NiNTA 
Sepharose. (Q Further purification of the IN-Ihip76 complex by cation exchange 
chromatography. The fractions 3, 4, 5 and 6 eluted from NiNTA (see panel A). They were 
pooled and loaded onto a 1 ml HiTrap SP Sepharose colmnn. The complex was eluted 
with linear gradient of NaCl concentration. The non-boimd (flow-through) fraction as 

25 well as the fractions #10 through #28 were analyzed by 1 1 % SDS PAGE and Coomassie 
staining. Positions of the molecular weight markers (kDa) are indicated. 

Figure 10. Concentration of the enzymatically-active IN-Inip76 complex. 

(A) The complex purified by chromatography on NiNTA and SP Sepharose was 
30 concentrated using Centricon-30. Equal volumes of the original non-concentrated sample 
(lane 1), the intermediate samples (lanes 2, 3) and die final concentrated sample (lane 4) 
were separated by 11 % SDS PAGE. (B) Activity of the purified and concentrated 
recombmant IN-Ihip76 complex was tested in the mini-HIV reaction in conditions 
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described in the Example Vn. Positions of the DNA molecular weight markers (kb) 
separated in a separate lane of the gel are indicated. 



Figure 11. Recombinant LEDGF/p75 forms a stable complex with BOTV-l IN and 
5 mv.2 IN, but not with HTLV-2 IN. 

The puU-down assay was performed as described in example 12. The specific buffer 
conditions, i.e. NaCl concentration are indicated. MgCl2 was present in all reactions. 
Recombinant LEDGF/p75 was incubated with His6-tagged HIV-1 IN (lanes 3 and 5 on 
both gels), HIV-2 IN Oanes 4 and 6 on the upper gel) or HTLV-2 IN (lanes 4 and 6 on the 

10 lower gel). The complexes were recovered on a Ni^**"-chelating resin (lanes 2-6). During 
the test some of the proteins aggregated, as shown in lanes 8-12. In the control samples 
(lane 2), recombinant IN was omitted. Lanes 1 and 7 reflect protein input in the reactions; 
BSA, LEDGF/p75 en Hise-IN were loaded in the same amounts as were present in the 
binding reactions. The respective positions of the proteins are indicated on the left side of 

15 the gel; the gel was stained using Coomassie R250. 



Figure 12 

A. Stable knock-down of LEDGF/p75 in transduced MOLT-4 - derived cell lines 

Extracts of polyclonal MOLT-4-derived cell lines were separated using sodium dodecyl 
20 sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and blotted onto a PVDF 

membrane. LEDGF/p75 was detected using monoclonal antibodies. Extracts of MOLT-4 

(lane 1) and MOLT-4 transduced with the following retroviral vectors: MSCV/U6-cont 

(lane 2) and MSCV/U6-p75 ( lane 3) are shown. 

B Delay of mV-1 replication in cell lines with silenced LEDGF/p75. 
25 The MOLT-4-derived cell line MSCV/U6-cont was infected with HIV-l(niB) at different 

moi: — • — (2.10"* moi), - -t- - (2.10"^ moi), - - (2.10"^ moi). In parallel the 

MOLT-4-derived cell line MSCV/U6-p75 was infected: o •••• (2.10"^ moi), V- 

• • - (2.10'^ moi), □ (2.10"^ moi). At different time points after infection p24 

was measured in the supernatant. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 

In accordance Avith the present invention there may be employed conventional molecular 
5 biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Sambrook et al, "Molecular 
Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular Biology" 
Volumes I-IH [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory Handbook" 
Volumes WH [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" Volumes I-IH 
10 [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M. J. Gait ed. 1984); "Nucleic 
Acid Hybridization" [B. D. Hames & S. J. Higgias eds. (1985)]; "Integration And 
Translation"[B. D. Hames & S. J. Higgins, eds. (1984)]; "Animal Cell Culture" [R, L 
Freshney, ed. (1986)]; "Imnmobilized Cells And Enzymes" [ERL Press, (1986)]; B. 
Perbal, "A Practical Guide To Molecular Clotiing" (1984). 

15 

Definitions 

If appearing herein, the following temis shall have the definitions set out below. 

20 The terms "Integration-interacting protein, "INIP" and any variants not specifically listed, 
may be used herein interchangeably, and as used throughout the present application and 
claims refer to protein material including single or multiple proteins, and extends to those 
proteins having the amino acid sequence substantially homologous to the sequence of 
LEDGF and the profile of activities of INIP76 as set forth herein and in the Claims. 

25 Accordingly, proteins displaying substantially equivalent or altered activity are likewise 
contemplated. These modifications may be deliberate, for example, such as modifications 
obtained through site-directed mutagenesis, or may be accidental, such as those obtained 
through mutations in hosts that are producers of the complex or its named subimits. Also, 
the terms "Integration-interacting protein", "BSTIP" and "IISllP(s)" are intended to include 

30 within their scope proteins specifically recited herein as well as all substantially 
homologous analogs and allelic variations. 
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A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as 
an autonomous unit of DNA replication in vivo; i.e., capable of replication mder its own 
control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
5 segment may be attached so as to bring about the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of deoxyribonucleotides (adenine, 
guanine, thymine, or cytosine) in its either single stranded form, or a double-stranded 
helix. This term refers only to the primary and secondary structure of the molecule, and 

10 does not limit it to any particular tertiary forms. Thus, this term includes double-stranded 
DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), viruses, 
plasmids, and chromosomes. In discussing the structure of particular double-stranded 
DNA molecules, sequences may be described herein according to the normal convention 
of giving only the sequence in the 5' to 3* direction along the nontranscribed strand of 

15 DNA (i.e., the strand having a sequence homologous to the mRNA). 



A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed and 
translated into a polypeptide in vivo when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding sequence are determined by a start 
20 codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxyl) 
terminxis. A coding sequence can include, but is not limited to, prokaryotic sequences, 
cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
mammalian) DNA, and even synthetic DNA sequences. A polyadenylation signal and 
integration temiination sequence will usually be located 3' to the coding sequence. 

25 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase 
in a cell and initiating transcription of a downstream (3' direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence is bounded at its 3' 
terminus by the transcription initiation site and extends upstream (5' direction) to include 
30 the minimmn number of bases or elements necessary to initiate transcription at levels 
detectable above background. Within the promoter sequence will be found a transcription 
initiation site (conveniently defined by mapping with nuclease SI), as well as protein 
binding domains (consensxxs sequences) responsible for the binding of RNA polymerase. 
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Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CAT" 
boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in addition to the -10 
and -35 consensus sequences. 

5 The term "oligonucleotide," as used herein in referring to the probe of the present 
invention, is defined as a molecule comprised of two or more (deoxy)ribonucleotides, 
preferably more than three. Its exact size will depend upon many factors which, in turn, 
depend upon the ultimate function and use of the oligonucleotide. A ^^polynucleotide" as 
used herein comprises at least 20 (deoxy)ribonucl60tides. 

10 

The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally 
as in a purified restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under conditions in which synthesis of a 
primer extension product, which is complementary to a nucleic acid strand, is induced, 

15 i.e., in the presence of nucleotides and an inducing agent such as a DNA polymerase and 
at a suitable temperature and pH. The primer may be either single-stranded or double- 
stranded and must be sufficiently long to prime the synthesis of the desired extension 
product in the presence of the inducing agent. The exact length of the primer will depend 
upon many factors, including temperature, source of primer and use of the method. For 

20 example, for diagnostic applications, depending on the complexity of the target sequence, 
the oligonucleotide primer typically contains 15-25 or more nucleotides, although it may 
contain fewer nucleotides. 

The primers herein are selected to be "substantially" complementary to different strands 
25 of a particular target DNA sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, the primer sequence 
need not reflect the exact sequence of the template. For example, a non-complementary 
nucleotide firagment may be attached to the 5* end of the primer, with the remainder of the 
primer sequence being complementary to the strand. Alternatively, non-complementary 
30 bases or longer sequences can be interspersed into the primer, provided that the primer 
sequence has sufficient complementarity with the sequence of the strand to hybridize 
therewith and thereby form the template for the synthesis of the extension product. 
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A cell has been "transformed" by exogenous or heterologous DNA when such DNA has 
been introduced inside the cell. The transforming DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes, yeast, and mammalian cells for example, the transforming DNA may be 
5 maintained on an episomal element such as a plasmid. With respect to eukaiyotic cells, a 
stably transformed cell is one in which the transforming DNA has become integrated into 
a chromosome so that it is inherited by daugihter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or 
clones comprised of a population of daughter ells containing the transforming DNA. A 
10 "clone" is a population of cells derived from a single cell or common ancestor by mitosis. 
A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for many 
generations. 

Two DNA sequences are "substantially homologous" when at least about 75% 
15 (particularly at least about 80%, and most particularly at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the sequences using standard 
software available in sequence data banks, or in a Southem hybridization experiment 
under, for example, stringent conditions as defined for that particular system. Defining 
20 appropriate hybridization conditions is within flie skill of the art. See, e.g., Maniatis et al., 
supra; DNA Cloning, Vols. I & n, supra; Nucleic Acid Hybridization, supra. 
Two amino acid sequences are "substantially homologous" when at least about 70% of 
the amino acid residues (preferably at least about 80%, and most preferably at least about 
90 or 95%) are identical, or represent conservative substitutions. 

25 

A "heterologous" region of the DNA construct is an identifiable segment of DNA within a 
larger DNA molecule that is not foimd in association with the larger molecule in nature. 
Thus, when the heterologous region encodes a mammalian gene, the gene will usually be 
flanked by DNA that does not flank the mammalian genomic DNA in the genome of the 
30 source organism. Another example of a heterologous coding sequence is a constract 
where the coding sequence itself is not found in nature (e.g., a cDNA where the genomic 
coding sequence contains introns, or synthetic sequences having codons different than the 
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native gene). Allelic variations or naturally-occurring mutational events do not give rise 
to a heterologous region of DNA as defined herein. 



An "antibody* is any immunoglobulin, including antibodies and fiagments thereof, that 
5 binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in fiirther detail in U.S. Pat Nos. 4,816,397 and 
4,816,567. 

An "antibody combining site" is that structural portion of an antibody molecule comprised 
of heavy and light chain variable and hypervariable regions that specifically binds 
10 antigen. 

The phrase "antibody molecule" in its various grammatical forms as used herein 
contemplates botii an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules are intact immimoglobulin molecules, substantially intact 
15 immunoglobulin molecules and those portions of an immunoglobulin molecule that 
contains the paratope, including those portions known in the art as Fab, Fab\ F(ab*).sub.2 
and F(v), which portions are preferred for use in the therapeutic methods described 
herein. 

Fab and F(ab')-sub.2 portions of antibody molecules are prepared by the proteolytic 
20 reaction of papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Pat. No. 4,342,566 to Theofilopolous 
et al. Fab' antibody molecule portions are also well-known and are produced firom FCab") 
portions followed by reduction of the disulfide bonds linking the two heavy chain 
portions as with mercaptoethanol, and followed by alkylation of the resulting protein 
25 mercaptan with a reagent such as iodoacetamide. An antibody containing intact antibody 
molecules is preferred herein. 

The phrase "monoclonal antibody" in its various grammatical forms refers to an antibody 
having only one species of antibody combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus typically displays a single binding affinity 
30 for any antigen with which it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a pluraUty of antibody combining sites, each 
immunospecific for a different antigen; e.g., abispecific (chimeric) monoclonal antibody. 

20 
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The phrase "phannaceutically acceptable" refers to molecular entities and compositions 
that are physiologically tolerable and do not typically produce an allergic or similar 
imtoward reaction, such as gastric upset, diiziness and the like, when administered to a 
himian. 

The phrase "therapeutically effective amount" is used herein to mean an amount sufficient 
to prevent, and preferably reduce by at least about 30 percent, more preferably by at least 
50 percent, most preferably by at least 90 percent, a clinically significant change in the 
activity of flie HIV infection. 

A DNA sequence is "operatively linked" to an expression control sequence when ttie 
expression control sequence controls and regulates the transcription and translation of that 
DNA sequence. The term "operatively linked" includes having an appropriate start signal 
(e.g., ATG) in fi-ont of the DNA sequence to be expressed and maintaining the correct 
reading frame to permit expression of the DNA sequence under the control of the 
expression control sequence and production of the desired product encoded by the DNA 
sequence. If a gene that one desires to insert into a recombinant DNA molecule does not 
contain an ^ipropriate start signal, such a start signal can be inserted in front of the gene. 

The term "standard hybridization conditions" refers to salt and temperature conditions 
substantially equivalent to S.times.SSC and 65.degree. C. for hoUx hybridization and 
wash. However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of sodium 
and magnesium in the buffer, nucleotide sequence length and concentration, percent 
mismatch, percent formamide, and the like. Also important in the determination of 
"standard hybridization conditions'* is whether the two sequences hybridizing are RNA— 
RNA, DNA~DNA or RNA-DNA. Such standard hybridization conditions are easily 
detemfiined by one skilled in the art according to well known formulae, wherein 
hybridization is typically 10-20.degree. C. below the predicted or determined T.sub.m 
with washes of higher stringency, if desired. 
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As used herein, "pg" means picogram, "ng" means nanogram, "ug" or ".mu.g" mean 
microgram, "mg" means nnlligram, "ul" or .mu.l" mean microliter, "ml" means milliliter, 
"1" means liter. 

5 

Description 

Using a synthetic gene, efiBcient expression of HIV-1 IN in hmnan cells had been 
achieved in the past (Cherepanov et al., 2000). In the present invention, HTV-l IN protein 

10 complexes present in nuclear extracts from cells stably expressing this viral protein have 
been characterized. A HIV integrase-interacting protein that forms a distinct complex 
with IN in human cells and an insight into the oligomeric state of intracellular HIV IN, 
suggesting that the minimal cellular IN complex is a tetramer is provided in the present 
invention. It is an object of present invention to prevent or inhibit the integration of the 

15 HTV genome into the host cell chromosomes by inhibiting the interaction between the 
integrase interacting protein and integrase. 

The present invention describes the isolation, purification, identification and recombinant 
production of integrase-interacting proteins (INIPs), and methods of use thereof. These 

20 cofactors participate in the HIV-1 integration process, and can be utilized in therapeutic 
and diagnostic methods relating to the treatm^t of AIDS. Therefore, in its primary 
aspect, the present invention concerns the identification of Integrase-interacting proteins 
(INIPs). In a particular embodiment, the present invention relates to all members of the 
herein disclosed Integrase-interacting proteins (INIPs), and especially to the Integrase- 

25 inhibiting protein, INIP76. 

One such INIP has been determined to be novel cofactor having the following 
characteristics: 

a) it binds with integrase (IN) and specifically stimulates strand-transfer by IN; and 
30 b) it possesses an apparent molecular weight of approximately 76 kDa as determined by 
SDS-polyacrylamide gel electrophoresis. 
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This cofactor, the polynucleotides expressing said cofactor and analogs, derivatives or 
fragments of both can be utilized iq the various methods of the present invention, since 
they form a complex with integrase. 

One of the characteristics of the present Integrase-inhibiting protein INIP76 is its function 
5 as a cofactor that interacts with IN, thereby participating in the increase in the level of 
integration of DNA by the HIV-1 regulatory protein IN. 



The present invention also relates to a recombinant DNA molecule or cloned gene or a 
polynacleotide, or a degenerate variant thereof, which encodes an Integration-interacting 
protein (INIP), or a fragment thereof that possesses a molecular weight of about 76 kD, * 
more in particular of maximally 76kDa, more in particular an Integrase-inhibiting protein 
INIP76 (INIP76), The present invention relates in particular to a nucleic acid molecule, in 
particular a recombinant DNA molecule or cloned gene, encoding the Integrase-inhibiting 
protein INIP76 (INIP76), and which has a nucleotide sequence substantially homologous 
or is complementary to the DNA sequence of LEDGF, DFS70 or p75. 

The human and murine DNA sequences of the Integrase-inhibiting protein INIP76 of the 
present invention or portions thereof, may be prepared as probes to screen for 
complemCTtary sequences and genomic clones in the same or alternate species. The 
20 present invention extends to probes so prepared that may be provided for screening cDNA 
and genomic libraries for the Integrase-inhibiting protein INIP76 (IN1P76). For example, 
the probes may be prepared with a variety of known vectors, such as the phage .lambda, 
vector. The present invention also includes the preparation of plasmids including such 
vectors, and the use of the DNA sequences to construct vectors expressing antisense RNA 
25 or ribozymes which would attack the mRNAs of INIP76. Correspondiagly, the 
prq)aration of antisense RNA and ribozymes are included herein. 



In a further embodiment of the invention, the full DNA sequence of the recombinant 
DNA molecule or cloned gene so determined may be operatively linked to an expression 
30 control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to xmicellular hosts transformed with the cloned gene or recombinant 
DNA molecule comprising a DNA sequence encoding the present Integrase-interacting 
protein, INIP76 
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According to other preferred features of certain preferred embodiments of the present 
invention, a recombinant expression system is provided to produce biologically active 
animal or himian Integration-interacting protein (INIP). 

5 Also within the scope of the present invention are DNA or RNA sequences 
(polynucleotides) having a sequence substantially homologous to the DNA sequence 
which codes LEDGF which code for a Integration-interacting protein (INIP), with an 
activity substantially that of INIP76. Mutations can be in this DNA sequences such that a 
particular codon is changed to a codon which codes for a different amino acid. Such a 

10 mutation is generally made by making the fewest nucleotide changes possible. A 
substitution mutation of this sort can be made to change an amino acid in the resulting 
protein m a non-conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an 
amino acid belonging to another grouping) or m a conservative manner (i.e., by changing 

15 the codon from an amino acid belonging to a grouping of amino acids having a particular 
size or characteristic to an amino acid belonging to the same grouping). Such a 
conservative change generally leads to less change in the stracture and function of the 
resulting protein. A non-conservative change is more likely to alter the structure, activity 
or function of tiie resulting protein. The present invention should be considered to include 

20 sequences containing conservative changes which do not significantly alter the activity or 
binding characteristics of the resulting protein. 

The concept of the Integration-interactiQg protein (INIP) contemplates that specific 
factors exist for correspondingly specific Ugands, such as Integration-interacting protein 

25 (INIP) and the like, as described earher. Accordingly, the exact structure of each 
Integration-interacting protein (INIP) will understandably vary so as to achieve this Ugand 
and activity specificity. It is this specificity and the direct involvement of the Integration- 
interacting protein (INIP) in the chain of events leading to the level of integration by IN 
that offers the promise of a broad spectrum of diagnostic and therapeutic utiUties for 

30 retroviral infection. 

The present invention naturally contemplates several means for preparation of the 
Integrase-interacting proteins (INIPs), including as illustrated herein known 
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recombinant techniques, and the invention is accordingly intended to cover such synthetic 
preparations within its scope. The known cDNA and mmxo acid sequences of 
LEDGF/DFS70/p75 facilitates the reproduction of the Integrase-interacting proteins 
(INIPs) by such recombinant techniques, and accordingly, the invention extends to 
expression vectors prepared from the disclosed DNA sequences for expression in host 
systems by recombinant DNA techniques, and to the resulting transfomied hosts. In 
present invention we also produced the recombinant INIP76-IN complex. 

Another feature of this invention is the expression of the DNA sequences encoding 
INIP76. As is well known m the art, DNA sequences may be expressed by operatively 
linking them to an expression control sequence in an appropriate expression vector and 
employing that expression vector to transform an appropriate unicellular host. 

Such operative linking of a DNA sequence of this invention to an expression control 
sequence, of course, includes, if not akeady part of the DNA sequence, the provision of 
an initiation codon, ATG, in the correct reading frame iq)stream of the DNA sequence. 

A wide variety of host/expression vector combinations may be employed in expressing 
the DNA sequences of this invention. Useftd expression vectors, for example, may consist 
of segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids 
col El, pCRl, pBR322, pMB9 and their derivatives, plasmids such as RP4; phage DNAS, 
e.g., the numerous derivatives of phage .lambda., e.g., NM989, and other phage DNA, 
e.g., M13 and filamentous single stranded phage DNA; yeast plasmids such as the 2.mu. 
plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in 
insect or mammalian cells; vectors derived from combinations of plasmids and phage 
DNAs, such as plasmids that have been modified to employ phage DNA or other 
expression control sequences; and the like. 

Any of a wide variety of expression control sequences-sequences that control the 
expression of a DNA sequence operatively linked to it-may be used in these vectors to 
express the DNA sequences of this invention. Such useful expression control sequences 
include, for example, the early or late promoters of SV40, CMV, vaccinia, polyoma or 
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adenovirus, the lac system, the trp system, the TAG system, the TRC system, the LTR 
system, the major operator and promoter regions of phage .lambda., the control regions of 
fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, 
the promoters of acid phosphatase (e.g., Pho5), the promoters of the yeast .alpha.-mating 
factors, and other sequences known to control the ejqjression of genes of prokaryotic or 
eukaryotic cells or their viruses, and various combinations &ereof. 

A wide variety of unicellular host cells are also useful in expressing the DNA sequences 
of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, 
such as strains of E. coU, Pseudomonas, Bacillus, Streptomyces, fongi such as yeasts, and 
animal cells, such as CHO, Rl.l, B-W and L-M cells, African Green Monkey kidney 
cells (e.g., COS 1, COS 7, BSGl, BSC40, and EMTIO), insect cells (e.g., Sfi>), and 
human cells and plant cells in tissue culture. 

It will be understood that not aU vectors, expression control sequences and hosts will 
function equally weU to express the DNA sequences of this invention. Neither will all 
hosts function equally well with the same expression system. However, one skilled in the 
art will be able to select the proper vectors, expression control sequences, and hosts 
without undue experimentation to accomplish the desired expression without departing 
from the scope of this invention. For example, in selecting a vector, the host must be 
considered because the vector must function in it. The vector's copy number, the ability to 
control that copy number, and the expression of any other proteins encoded by the vector, 
such as antibiotic markers, will also be considered. 

In selecting an e5q)ression control sequence, a variety of factors will normally be 
considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to be 
expressed, particularly as regards potential secondary structures. Suitable unicellular hosts 
will be selected by consideration of, e.g., their compatibility with the chosen vector, their 
secretion characteristics, their ability to fold proteins correctly, and their fermentation 
requirements, as well as the toxicity to the host of the product encoded by the DNA 
sequences to be expressed, and the ease of purification of the expression products. 
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Considering these and other factors a p^son skilled in the art will be able to construct a 
variety of vector/expression control sequence/host combinations that will express the 
DNA sequences of this invention on fermentation or in large scale animal culture. 

It is further intended that the preparation of Integration-interacting protein (INIP) analogs 
or fragments may be pr^ared and derived from the respective nucleotide sequences of 
the protein complex/subunit, falls within the scope of the present invention. Analogs or 
fragments, n^iy be produced, for example, by pepsin digestion of Integration-interacting 
protein (INIP) material. Other analogs, such as muteins, can be produced by standard site- 
directed mutagenesis of Integration-interacting protein (INIP) coding sequences. Analogs 
exhibiting "Integration-interacting protein (INIP) activity" such as small molecules, 
whether functioning as promoters or inhibitors, may be identified by known in vivo 
and/or in vitro assays. 

As mentioned above, a DNA sequence encoding Integration-interacting protein (INIP) 
can be prepared synthetically rather than cloned. The DNA sequence can be designed 
with the appropriate codons for the Integration-interacting protein (INIP) amino acid 
sequence. In general, one will select preferred codons for the intended host if the 
sequence will be used for expression. The complete sequence is assembled from 
overl^ping oligonucleotides prepared by standard methods and assembled into a 
complete coding sequence. See, e.g.. Edge, Nature, 292:756 (1981); Nambair et al.. 
Science, 223:1299 (1984); Jay et al., J. Biol. Chem., 259:6311 (1984). 

Synthetic DNA sequences allow convenient construction of genes which will express 
Integration-interacting protein (INIP) analogs or "muteins". Alternatively, DNA encoding 
muteins can be made by site-directed mutagenesis of native Integration-interacting 
protein (INIP) genes or cDNAs, and muteins can be made directly using conventional 
polypeptide synthesis. 

A general method for site-specific incorporation of imnatural amino acids into proteins is 
described in Christopher J. Noren, Spencer J. Anthony-CahiU, Michael C. Griffith, Peter 
G. Schultz, Science, 244:182-188 (April 1989), This method may be used to create 
analogs with unnatural amino acids. 
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In general, techniques for nucleic acid or protein/peptide manipulation are well known. 
(See, e.g. Annual Rev. of Biochem. 1992; 61:131-156). Reagents useful in applying such 
nucleic acid manipulation techniques, such as restriction enzymes and the like, are widely 
5 known in the art and commercially available from a number of vendors. Large amounts of 
the nucleic acid sequences encoding the Inip76 or its antagonists or modulators may be 
obtained using well-established molecular biology procedures such as molecidar cloning 
and replication of the vector or plasmid carrying the sequences in a suitable host cell. 
Bither complete gene sequences or partial sequences encoding the desired Imp76 or its 

10 antagonists or modulators can be employed. The nucleic acid sequences encoding the 
Inip76 or its antagonists or modulators can also be produced in part or in total by 
chemical synthesis, e.g. by the phosphoramidite method described by Beaucage and 
Camithers (Tetra Letts. 1981; 22:1859-1862) or the triester method (Matteucci et al., J. 
Am, Chem. Soc. 1981; 103:3185) and may be performed on commercial automated 

15 oligonucleotide synthesizers. A double-stranded fragment may be obtained from the 
single-stranded product of chemical synthesis either by synthesizing the complementary 
strand and annealing the strand together under ^propriate conditions, or by synthesizing 
the complementary strand using DNA polymerase with an appropriate primer sequence. 

20 The invention includes an assay system for screening of potential drags effective to 
modulate Ihtegrase-interacting proteins (INIPs) activity of target mammalian cells by 
interrupting or potentiating the Integrase-interacting proteins (INIPs), In one instance, the 
test drag could be administered to a cellular sample with the ligand that activates the 
Integrase-interacting proteins (INIPs), or an extract containing the activated Integrase- 

25 interacting proteins (INIPs), to determine its effect upon the binding activity of the 
Integrase-interacting proteins (INIPs) to any chemical sample (including DNA or 
integrase), or to the test dmg, by comparison wdth a control. 

The assay system could more importantly be adapted to identify drags or other entities 
that are capable of binding to the Integrase-interacting proteins (INIPs), either 
30 extracellular, in the cytoplasm or in the nucleus, thereby inhibiting or potentiating the 
level of retrovirus integration action or strand-transfer activity of IN. Such an assay would 
be useful in the development of drags that would be specific against the cellular 
receptivity for integration of exogenous (in the meaning of heterologous or homologous 
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but introduced or inserted into the same natural ceU type but in an unnatural state) or of 
introduced DNA in its chromosomal DNA or that would potentiate such activity, in time 
or in level of activity. For example, such drugs might be used to decrease or prevent the 
repUcation of the HIV-1 virus in its host organism, or such drug mi^t be used to increase 
the integration of a transgene in cellular chromosomal DNA. 

In accordance with the above, an assay system for screening potential drugs effective to 
modulate the activity of the Integration-interacting protein (INIP) may be prepared. The 
Integration-interacting protein (IMP) may be introduced into a test system, and the 
prospective drug may also be introduced into the resulting cell culture, and the culture 
thereafter examined to observe any changes in the Integration-interacting protein (INIP) 
activity of the cells, due either to the addition of the prospective drug alone, or due to the 
effect of added quantities of the known Integration-interacting protein (INIP). 

In yet a further embodiment, the invention contemplates antagonists or modulators 
(increase or decrease) of the activity of a Integration-interacting protein (INIP) and their 
use, more in particular as an antiviral therapeutic or for the manufacture of a medicament 
for the prevention or treatment of a viral infection, more in particular a lentiviral 
infection, yet more in particular a HIV-infection. The invention also relates to 
compositions containing such antagonists or modulators. The invention relates in 
particular to an agent or molecule that inhibits, influences or modulates binding of 
Integrase-inhibiting protein, INIP76 to integrase. In a specific embodiment, the antagonist 
can be a peptide, more particularly a peptide having the sequence of a portion of the 
binding domain of the Integrase-inhibiting protein INIP76, which binds to the IN protein 
or forms an Imp76-IN complex. In other embodiments, the antagonists are selected from 
smaU molecules, oligonucleotides (antisense or aptamers), antigene therapeutics, smaU 
interfering RNAs (or RNA interference in general), soluble receptors, antibodies and/or 
cellular therapies. 

In a further embodiment. The present invention also relates to methods of treating or 
preventing viral infections by using antagonists or modulators of the binding of Inip76 
to IN. The present invention relates to certain therapeutic methods which would be based 
upon the activity of the Integration-interactmg protein (INIP)(s), its (or their) subunits, or 
active fragments thereof or upon agents or other drugs determined to possess the same 
activity. A first therapeutic method is associated with the prevention of the manifestations 
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of conditions causally related to or following &om the binding activity of the Integration- 
interacting protein (INIP).or its subunits, and comprises administering an agent capable of 
modulating the production and/or activity of the Integration-interacting protein QNIP) or 
subunits thereof, either individually or in mixture with each other in an amount effective 
5 to prevent the development of those conditions in the host For example, drugs or other 
binding partners to the Integration-interacting protein (INIP) or proteins may be 
administered to inhibit Integration-interacting protein (INIP) activity, as in the inhibition 
of Integration-interacting protein (INIP) in anti-HTV-l therapy. 

Also, the increase of the action of specific tyrosine phosphatases in the phosphorylation 
10 of activated ^hosphorylated) Integration-interacting protein (INIP) or proteins presents a 
method for inhibiting the activity of the Integration-interacting protein (INIP) or protein 
that would concomitantly provide a therapy for retroviral infection based on Integration- 
interacting protein (INIP)/protein inactivation. 

More specifically, the therapeutic method generally referred to herein covdd include the 
15 method for the treatment of various retroviral infection pathologies or may integration of 
a DNA in cellular chromosomal DNA by the administration of pharmaceutical 
compositions that may comprise effective inhibitors of activation of the Integration- 
interacting protein (INIP) or its subunits, or other equally effective drags developed for 
instance by a drag screening assay prepared and used in accordance with a fiirflier aspect 
20 of the present invention. 

Also, an increase of the action of specific tyrosine phosphatases in the dephosphorylation 
of activated Integrase-interacting proteins (INIPs) presents a method for inhibiting or 
decreasing the activity of the Integrase-interacting proteins (INIPs) that would 
concomitantly potentiate therapies based on Integration-interacting protein (INIP) 
25 deactivation. 

In the present invention small molecules, e.g. small organic molecules with a molecular 
mass <500 Da and other suitable molecules, can also function as antagonists or 
modulators of Inip76 in order to prevent or treat a viral infection. Small molecules and 
30 other drag candidates can readily be obtained, for example, firom combinatorial and 
natural product libraries and using methods known to the art to screen candidate 
molecules for their Iiiip76 antagonizing or modulating function. Screening assays in this 
regard are known in the art, as described for example in PCT/US99/1200L Furthermore, 
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random peptide libraries, consisting of all possible combinations of anmio acids, attached 
to a solid phase or in solution, may also be used to identify peptides that act as antagonists 
ofInip76. 

5 Suitable antagonists or modulators of the interaction of Imp76 with HTV IN can also be 
developed by known drug design methods, e.g. using structural analysis of the IN, hup76 
or the complex by employing methods established in the art, for example, using X-ray 
crystallography to analyze the stracture of the complex foraied ( see for example Sielecki, 
A.R. et al. Science 1989; 243:1346-51; Dhanaraj, V. et aL Nature 1992; 357(6377):466- 
10 72) and/or by modifying known LEDGF binding molecules i.e. "lead compounds," to 
obtain (more potent) inhibitors and compounds for different modes of administration (i.e. 
oral vs. intravenous). 

The present invention relates to antibodies directed against Inip76, In and disrupting or 
15 modulating the complex formation between Inip76 and IN and the use of said antibodies. 
The invention also relates to the development of antibodies against the Integrase- 
interacting proteins (INIPs) and to compositions containing them, including naturally 
raised and recombinantly prepared antibodies. Such antibodies could include both 
polyclonal and monoclonal antibodies prepared by known genetic techniques, as well as 
20 bi-specific (chimeric) antibodies, and antibodies including other functionality suiting 
them for additional diagnostic use conjunctive with their capability of modulating 
Integrase-inhibiting protein INIP76 (INIP76) activity. 

In particular, antibodies against specifically phosphorylated factors can be selected and 
are included withia the scope of the present invention for their particular ability in 
25 following activated protein. Thus, activity of the Integration-interacting protein (INIP) or 
of the specific polypeptides believed to be causally connected thereto may therefore be 
followed directly by the assay techniques discussed later on, through the use of an 
appropriately labelled quantity of the Integration-interacting protein (INIP) or antibodies 
or analogues thereof. 

30 

Also, antibodies including both polyclonal and monoclonal antibodies, and drugs that 
modulate the production or activity of the Integrase-interacting proteins (INIPs) and/or 
their subunits may possess certain diagnostic applications and may for example, be 
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Utilized for fhe purpose of detecting and/or measuring conditions such as HTV-l infection 
or the like. 

For example, the Integrase-interacting proteins (INIPs) or their subunits may be used to 
produce both polyclonal and monoclonal antibodies to thraiselves in a variety of cellular 
media, by known techniques such as the hybridoma technique utilizing, for example, 
fused mouse spleen lymphocytes and myeloma cells. Likewise, small molecules that 
mimic or antagonize the activity(ies) of the Integrase-interacting proteins (INIPs) of the 
invention may be discovered or synthesized, and may be used in diagnostic and/or 
therapeutic protocols. 

The general methodology for making monoclonal antibodies by hybridomas is well 
known. Immortal, antibody-producing cell lines can also be created by techniques other 
than fusion, such as direct transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. See, e.g., M. Schreier et al., "Hybridoma 
Techniques" (1980); Hammerlinig et al., "Monoclonal Antibodies And T-cell 
Hybridomas" (1981); Kennett et al., "Monoclonal Antibodies" (1980); see also, U.S. Pat. 
Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,451,570; 4,466,917; 4,472,500; 
4,491,632; 4,493,890. 

Methods for producing monoclonal anti-Integration-interacting protein (INIP) antibodies 
are also well-known in the art. See Niman et al., Proc. Natl. Acad. Sci. USA, 80:4949- 
4953 (1983). Typically, the present Litegration-interacting protein (INIP) or a peptide 
analog is used either alone or conjugated to an immimogenic carrier, as the immimogen in 
the before described procedure for producing anti-Integration-interacting protein (INIP) 
monoclonal antibodies. The hybridomas are screened for the ability to produce an 
antibody that immunoreacts with the Integration-interacting protein (INIP) peptide analog 
and the present Integration-interacting protein (INIP). 

Panels of monoclonal antibodies produced against Integration-interacting protein (INIP) 
peptides can be screened for various properties; i.e., isotope, epitope, affinity, etc. Of 
particular interest are monoclonal antibodies that neutralize the activity of the Integration- 
interacting protein (INIP) or its subunits. Such monoclonals can be readily identified in 
Integration-interacting protein (INIP) activity assays. High afBnity antibodies are also 
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iiseful when inamimoafiBbiity purification of native or recombinant Integration-interacting 
protein (INIP) is possible. 

Methods for producing polyclonal anti-polypeptide antibodies are well-known in the art. 
See U.S. Pat. No. 4,493,795 to Nestor et al. A monoclonal antibody, typically containing 
5 Fab and/or F(ab*).sub.2 portions of useful antibody molecules, can be prepared using the 
hybridoma technology described in Antibodies-A Laboratory Manual, Harlow and Lane, 
eds.. Cold Spring Harbor Laboratory, New York (1988), which is incorporated herein by 
reference. Briefly, to form the hybridoma firom which the monoclonal antibody 
composition is produced, a myeloma or other self-perpetuating cell line is fused with 
10 lymphoc3rtes obtained firom the spleen of a mammal hyperimmunized with a Integration- 
interacting protein (INIP)-biading portion thereof, or Integration-interacting protein 
(INIP), or an origin-specific DNA-binding portion thereof 

Splenocytes are typically fused with myeloma cells using polyethylene glycol (PEG) 
6000. Fused hybrids are selected by their sensitivity to HAT. Hybridomas producing a 
15 monoclonal antibody useful in practicing this invention are identified by their ability to 
immimoreact with the present Integration-interacting protein (INIP) and their abiUty to 
inhibit specified INIP activity in target cells. 

A monoclonal antibody usefiil in practicing the present invention can be produced by 
initiating a monoclonal hybridoma culture comprising a nutrient medium containing a 
h34>ridoma that secretes antibody molecules of the appropriate antigen specificity. The 
culture is maintained under conditions and for a time period sufficient for the hybridoma 
to secrete the antibody molecules into the medium. The antibody-containing medium is 
then collected. The antibody molecules can then be further isolated by well-known 
techniques. 

Media usefixl for the preparation of these compositions are both well-known in the art and 
commercially available and include synthetic culture media, inbred mice and the Uke. An 
exemplary synthetic medium is Dulbecco's minimal essential medium (DMBM; Dulbecco 
30 et aL, Virol. 8:396 (1959)) supplemented with 4.5 gm/1 glucose, 20 mm glutamine, and 
20% fetal calf serum. An exemplary inbred mouse strain is the Balb/c. 
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Inhibition of expression of a Inip76 can be desirable to treat or prevent viral infections. 
Where inhibition of expression of Imp76 is desirable, inhibitory nucleic acid sequences 
that interfere with expression of Inip76 at the transcriptional or translational level can also 
be used. The strategy called antisense, antigene or RNA-interference can be applied. 
These approaches utilise, for example, antisense nucleic acids, ribozymes, triplex agents 
or siRNAs to block transcription or translation of Ihip76 mRNA or DNA or of a specific 
mRNA or DNA of Inip76, either by masking that mRNA with an antisense nucleic acid 
or DNA with a triplex agent, by cleaving the nucleotide sequence with a ribozyme or by 
destruction of the mRNA tiirough a complex mechanism involved in RNA-interference. 
The present invention extends to antisense oligonucleotides, ribozymes, RNA interference 
and antigene therapeutics, their use and their preparation. 

Antisense nucleic acids are DNA or RNA molecules or nucleic acid analogs (e.g. hexitol 
nucleic acids. Peptide nucleic acids) that are complementary to at least a portion of a 
specific mRNA molecule (Weintraub Scientific American 1990; 262:40). In the cell, the 
antisense nucleic acids hybridise to the corresponding mRNA, forming a double-stranded 
molecule. The antisense nucleic acids interfere with the translation of the mRNA, since 
the cell will not translate a mRNA that is double-stranded. Antisense oligomers of about 
15 nucleotides are preferred, since they are easily synthesised and are less likely to cause 
problems than larger molecules when introduced into the target cell which produces 
Inip76. Also nucleic acids or analogs, complementary to the translation initiation site, e.g. 
between -10 or +10 regions of the Inip76 nucleotide sequence, are preferred. 
The potency of antisense oligonucleotides for inhibiting Inip76 may be enhanced using 
various methods including addition of polylysine, encapsulation into liposomes (antibody 
targeted, cationic acid, Sendai virus derived, etc.) or into nanoparticels in order to deliver 
the oligonucleotides into cells. Other techniques for enhancing the antisense capacity of 
ohgonucleotides exist, such as the conjugation of the antisense oligonucleotides for 
example to "cell penetrating peptides" (Manoharan, M. Antisense Nucleic Acid Drug 
Dev. 2002; 12(2): 103-128 / Juliano, R..L. Curr. Opin. Mol. Ther. 2000; 2(3): 297-303). 

Use of for example an oligonucleotide or a PNA (Peptide nucleic acid) to stall 
transcription is known as the antigene strategy (e.g. triplex formation) In the case of 
ohgonucleotides, the oUgomer winds around double-helical DNA (major groove). 
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fonning a three-stranded helix. Therefore, these antigene compounds can be designed to 
recognise a unique site on a chosen gene and block transcription of that gene in vivo. 
(Maher et al. Antisense Res. and Dev. 1991; 1:227; Helene. C. Anticancer Drug Design 
1991; 6:569 / Casey, B.P. et al. Prog. Nucleic Acid Res. Mol. Biol. 2001; 67: 163-192 / 
Pooga, M. et al. Biomol. Eng. 2001; 17(6): 183-192 / Nielsen, P.E. Phannacol. Toxicol. 
2000; 86(1): 3-7). Antigene oUgonucleotides as wel as PNAs are easily synthesised by the 
man skilled in the art and are even conunercially available. 

Ribozymes are molecules possessing the abihty to specificaUy cleave other single 
stranded RNA in a manner analogous to DNA restriction endonucleases. Through the 
modification of nucleotide sequences which encode these RNAs, it is possible to engineer 
molecules that recognise specific nucleotide sequences in an RNA molecule and cleave it 
(Cech J. Amer. Med. Assn. 1988; 260:3030). A major advantage of this approach is that, 
because they are sequence-specific, only mRNAs with particular sequences are 
inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhofif (1988) 
Nature 334:585) and "hammerhead"-type. Tetrahymena-type ribozymes recognise 
sequences which are four bases in length, while "hammerhead"-type ribozymes recognise 
base sequences 1 1-18 bases in length. The longer the recognition sequence, the greater the 
likeUhood that the sequence will occur exclusively in the target mRNA species. 
Consequently, hammerhead-type ribozymes are preferable to tetrahymena-type ribozymes 
for inactivating a specific mRNA species and 18-based recognition sequences are 
preferable to Sorter recognition sequences. 

Specific ribozyme cleavage sites wilhin any potential RNA target are initially identified 
by scanning the target molecule for ribozyme cleavage sites which include the following 
sequences. GUA, GUU and GUC. Once identified, short RNA sequences of between 15 
and 20 ribonucleotides corresponding to the region of the target gene containing the 
cleavage site may be evaluated for predicted structural features such assecondary 
stracture that may render the oUgonucleotide sequence unsuitable. The suitabihty of 
candidate targets may also be evaluated by testing their accessibihty to hybridization with 
complementary oligonucleotides, using ribonuclease protection assaj^. 
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RNA-interference involves the insertion of small pieces of double stranded (ds) and even 
single stranded RNA into a cell. If the dsRNA corresponds with a gene in the cell, it will 
promote the destruction of mRNA produced by that gene, thereby preventing its 
expression. The technique has been shown to work on a variety of genes, even in himian 
cells and in vivo. For example small interfering RNAs (siRNA), short-haiipin RNAs 
(shRNA) or vectors expressing such nucleic acids can be ^plied in flie RNA-interference 
strategy in order to inhibit the translation of Inip76-mRNA. 

Therefore, the present invention relates to flie use of nucleic acids mediating RNA 
interferrace specific for mRNA of a protein of the hepatoma-derived growfli factor 
family, for the modulation of the nuclear transport of lentiviral integrase and to the use of 
nucleic acids mediating RNA interference specific for mRNA of a protein of the 
hepatoma-derived growth factor family, for the manufacture of a medicament for the 
prevention and/or treatment of a viral infection. This includes the use of siRNA, shRNA 
and vectors expressing nucleic acids for RNA interference. 

Antisense RNA, DNA molecules and analogs, ribozymes, antigene compounds or nucleic 
acids for RNA interference of the invention may be prepared by any method known in ttie 
art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides well known in the art such as for example solid phase 
phosphoramidite chemical syntiiesis. Alternatively, RNA molecules may be generated by 
in vitro and in vivo transcription of DNA sequences encoding the antisense RNA 
molecule. The DNA sequences described herein may thus be used to prepare such 
molecules. Such DNA sequences may be incorporated into a wide variety of vectors 
which incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that synthesize anti-sense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into 
cell lines 

Suppression function of Inip76 can also be achieved through administration of variant 
polypeptide (dominant negative variant form) of Inip76, or a nucleotide sequence 
encoding variant polypeptide of Inip76. By administering an Inip76 variant polypeptide 
or a nucleotide sequence encoding such polypq)tide, the variant will compete with wild- 
type Ihip76 for binding to its receptor. 
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Another aspect of the present invention is the use of gene transfer, including gene 
therapy, to deliver above mentioned molecules antagonising or modulating the binding of 
Imp76 to IN. Gene therapy means the treatment by the delivery of therapeutic nucleic 
5 acids to patient's cells. This is extensively reviewed in Lever and Goodfellow 1995; Br, 
Med BulL,51, 1-242; Culverl995; Ledley, F.D. 1995. Hum. Gene Ther. 6, 1129. By gene 
transfer, a nucleic acid encodiug a Inip76-antagonising agent is introduced into cells in a 
subject to express the Imp76-antagonist and mhibit the hiip76 function. To achieve gene 
therapy there must be a method of delivering genes to the patient's cells and additional 

10 methods to ensure the efifective production of any therapeutic genes. There are two 
general approaches to achieve gene delivery; these are non-viral delivery and virus- 
mediated gene delivery. Vectors as described herein could be sued for this purpose. 
In one embodiment of the invention, nucleic acid encoding a antagonist or modulator of 
Inip76 is introduced in a subject in order to express the antagonist or modulator and 

15 prevent or treat a viral infection. For gene transfer, the key steps are 1) to select the mode 
of delivery, e.g. a proper vector for deUvery of the inhibitor genes to the subject, 2) 
administer the nucleic acid to the subject; and 3) achieve appropriate expression of the 
transferred gene for satisfactory durations. Methods for gene transfer are known in the art. 
The method described below are merely for purposes of illustration and are typical of 

20 those that can be used to practice the invention. However, other procedures may also be 
employed, as is understood in the art. Most of the techniques to construct delivery 
vehicles such as vectors and the like are widely practiced in the art, and most practitioners 
are familiar with the standard resource materials which describe specific conditions, 
reagents and procedures which are described in the literature. 

25 

In a specific embodiment of the invention, it is clear that the preventive or treatment 
methods or that the antagonists or modulators of Inip76 as described herein, can also be 
used in combination with any therapy or compound known in the art for viral infections, 
more in particular HIV infections. The invention also relates to synergistic combinations, 
30 especially combinations with integrase inhibitors. Possible anti-viral agents for inclusion 
into the antiviral compositions or combined preparations of this invention include, for 
instance, interferon-alfa (either pegylated or not), nucleoside reverse transcriptase (RT) 
inhibitors (i.e. zidovudine, didanosine, stavudine, lamivudine, zalcitabine and abacavir), 
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non-nucleoside reverse transcriptase inhibitors (i.e. nevirapine, delavirdine and 
efavirenz), protease inhibitors (i.e. saquinavir, indinavir, ritonavir, nelfinavir, amprenavir 
and lopinavir), integrase inhibitors such as the pyranodipyriniidines, L-708,906 or Si- 
1360, fiision inhibitor enfuvirtide, ribavirin, vidarabine, acyclovir, gancyclovir, 
5 amantadine, rimantadine and other selective inhibitors of the replication of HIV. The 
composition of the invention can also contain dmgs having a general beneficial activity 
for virally infected mammals, such as antibiotics or corticosteroids. 

The diagnostic utility of the present invention extends to the use of the present Integrase- 
10 interacting proteins (INIPs) in assays to screen for Integrase-interacting proteins (INIPs), 
to assay the infectability of a subject by a retroviral or measuring conditions such as HIV- 
1 infection or the like. 

In a particular embodiment, antibodies and other antagonists or modulators of Inip76 can 
be used for this pxirpose. The Integrase-interacting proteins (INIPs), their analogs and/or 
15 analogs, and any antagonists or antibodies that may be raised thereto, are capable of use 
in connection with various diagnostic techniques, including immunoassays, such as a 
radioimmunoassay, using for example, an antibody to the Integration-interacting protein 
(INIP) that has been labeled by either radioactive addition, or radioiodination. 

20 In an immunoassay, a control quantity of the antagonists or antibodies thereto, or the like 
may be prepared and labeled with an enzyme, a specific binding partner and/or a 
radioactive element, and may then be introduced into a cellular sample. 
After the labeled material or its binding partner(s) has had an opportunity to react with 
sites within the sample, the resulting mass may be examined by known techniques, which 

25 may vary with the nature of the label attached. For example, antibodies against 
specifically phosphorylated factors may be selected and appropriately employed in the 
exemplary assay protocol, for the purpose of following activated protein as described 
above. 

In the instance where a radioactive label, such as the isotopes .sup.^ H, .sup.^"* C, .sup,^^ P, 
30 .sup.^^ S, .sup.^^ CI, .sup.^^ Cr, .sup.^^ Co, .sup.^^ Co, .sup.^^ Fe, .sup.^^ Y, .sup.^^^ I, 
.sup.^-^^ I, and .sup.^^^ Re are used, known currently available counting procedures may be 
utilized. In the instance where the label is an enzyme, detection may be accomplished by 
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any of the presentiy utilized colorimetric, spectrophotometric, fluorospectrophotometric, 
amperometric or gasometric techniques known in the art. 



The present mvention includes an assay system which may be prepared in the form of a 
5 test kit for the quantitative analysis of the receptivity of a test subject for retroviral 
infection or to measuring conditions such as HIV-1 infection or the like by the extent of 
the presence of the Litegration-interacting protein (INIP), or to identify drugs or other 
agents that may mimic or block their activity. The system or test kit may comprise a 
labeled component prepared by one of the radioactive and/or enzymatic techniques 
10 discussed herein, coupling a label to the Integration-interacting protein (INIP), their 
agonists and/or antagonists, and one or more additional immunochemical reagents, at 
least one of which is a free or immobilized ligand, capable either of binding with the 
labeled component, its binding partner, one of the components to be determined or their 
binding partner(s). 

15 

Particularly, the anti-Integration-interacting protein (INIP) antibody used in the diagnostic 
methods of this invention is an affinity purified polyclonal antibody. More preferably, the 
antibody is a monoclonal antibody (mAb). In addition, it is preferable for the anti- 
Integration-interacting protein (INIP) antibody molecules used herein be in the form of 
20 Fab, Fab', F(ab').sub.2 or F(v) portions of whole antibody molecules. 

As suggested earlier, the diagnostic method of tiie present invention comprises examining 
a cellular sample or medium by means of an assay including an effective amount of an 
antagonist to a Integration-interacting protein (INIP)/protein, such as an anti-Ihtegration- 

25 interacting protein (INIP) antibody, preferably an afiSnity-purified polyclonal antibody, 
and more preferably a mAb. In addition, it is preferable for the anti-Integration- 
interacting protein (INIP) antibody molecules used herein be in the form of Fab, Fab*, 
F(ab').sub.2 or F(v) portions or whole antibody molecules. As previously discussed, 
patients capable of benefiting fi-om this method include those suffering from an HTV-l 

30 infection or other like pathological derangement. Methods for isolating the Integration- 
interacting protein (INIP) and induciug anti-Integration-interacting protein (INIP) 
antibodies and for determining and optimizing the ability of anti-Ihtegration-interacting 
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protein (INIP) antibodies to assist in the examination of the target cells are all well-known 
in the art. 

The possibilities both diagnostic and therapeutic that are raised by fhe existence of tiie 
5 Integration-interacting protein (UMIPs), derive from the fact that the factors ^pear to 
participate in direct and causal protein-protein interaction between the Integration- 
interacting protein (INIPs), and those factors fliat fliereafter interact with IN and thereby 
increase inetgration. As suggested earlier and elaborated further on herein, the present 
invention contemplates pharmaceutical intervention in the cascade of reactions in which 
10 the Integration-interacting protein (INIPs) are implicated, to modulate the activity 
initiated by the Integration-interacting protein (INIPs). 

Thus, in instances where it is desired to reduce or inhibit the (resulting from a particular 
stimulus or factor, an appropriate inhibitor of the Integration-interacting protein (DSTIP) 
could be introduced to block the iateraction of the Integration-interacting protein (DSTIP) 
15 with IN thereby affecting its strand-transfer activity. 

The present invention also relates to a variety of diagnostic ^plications, including 
methods for detecting the presence of stimuli such as the earlier discussed INIP, by 
reference to their ability to elicit the specific activities which are mediated by the 
20 Integration-interacting protein (INIP). 

As mentioned earlier, the Integration-interacting protein (INIP) or the protein INIP 
interacts with can be used to produce antibodies to by a variety of known techniques (e.g. 
hybridoma techniques), and such antibodies could then be isolated and utilized as in tests 
25 for determining the presence of particular Integration-interacting protein (INIP) activity in 
suspect target cells. For convenience, the antibody(ies) to the Integration-interacting 
protein (INIP) will be referred to herein as Ab.sub.l and antibody(ies) raised in another 
species as Ab.sub.2. 

30 The presence of hitegration-interacting protein (INIP) in cells can be ascertained by the 
usual immimological procedures applicable to such determinations. A number of useful 
procedures are known. Three such procedures which are especially usefiiU utilize either 
the Integration-interacting protein (INIP) labeled with a detectable label, antibody 
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Ab.sub.2. labeled witii a detectable label, or antibody Ab.sub.2 labeled with a detectable 
label. The procedures may be summarized by the foUowing equations wherein the asterisk 
indicates that the particle is labeled, and INIP stands for the Integration-interacting 
protein: 

5 A. INIP*+Ab,sub.l INIP*Ab.sub.l 

B. INIP+Ab*=TIPAb.sub.l * 

C. INIP+Ab.sub.l +Ab.sub.2 *=TIPAb.sub.l Ab.sub.2 * 



The procedures and their application are all familiar to those skilled in the art and 
10 accordingly may be utilized within the scope, of the present invention. The "competitive" 
procedure. Procedure A, is described in U.S. Pat. Nos. 3,654,090 and 3,850,752. 
Procedure C, the "sandwich" procedure, is described in U.S. Pat. Nos. RE 31,006 and 
4,016,043. Still other procedures are known such as the "double antibody," or "DASP" 
procedure. 

15 In each instance, the Integration-interacting protein (INIP) forms complexes with one or 
more antibody(ies) or binding partners and one member of the complex is labeled with a 
detectable label. The fact that a complex has formed and, if desired, the amount thereof, 
can be determined by known methods applicable to the detection of labels. 
It will be apparent from the above, that a characteristic property of Ab.sub.2 is that it will 

20 react with Ab.sub.l. This is because Ab.sub.l raised in one mammalian species has been 
used m another species as an antigen to raise the antibody Ab.sub.2. For example, 
Ab.sub.2 may be raised in goats using rabbit antibodies as antigens. Ab.sub.2 therefore 
would be anti-rabbit antibody raised in goats. For purposes of this description and claims, 
Ab.sub.l will be referred to as a primary or anti-Integration-interacting protein (INIP) 

25 antibody, and Ab,sub.2 will be referred to as a secondary or anti-Ab.sub.l antibody. 

The labels most commonly employed for these studies are radioactive elements, enzymes, 
chemicals which fluoresce when exposed to ultraviolet light, and others. 
Radioactive labels can be detected by any of the currentiy available coxmting procedures. 
30 The preferred isotope may be elected from 3H, .sup.^"^ C, .sup.^^ P, .sup.^^ S, .sup.^^ CI, 
.sup.^^ Cr, .sup.^^ Co, .sup.^^ Co, ,sup.^^ Fe, .sup.^° Y, .sup.^^^ I, .sup.^^^ I, and .sup.^^^ Re. 
Enzyme labels are likewise useful!, and can be detected by any of the presently utilized 
calorimetric, spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
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techniques. The enzyme is conjugated to the selected particle by reaction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde and the like. Many 
enzymes which can be used in these procedures are known and can be utilized. The 
preferred are peroxidase, .beta.-glucuromdase, .beta.-D-glucosidase, .beta.-D- 
5 galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. U.S. Pat. 
Nos. 3,654,090; 3,850,752; and 4,016,043 are referred to by way of exanq)le for their 
disclosure of alternate labeling material and methods. 

A number of fluorescent materials are known and can be utilized as labels. These include, 
for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and Lucifer 
10 Yellow. A particular detecting material is anti-rabbit antibody prepared in goats and 
conjugated with fluorescein through an isothiocyanate. 

A particular assay system developed and utilized in accordance with the present 
invention, is known as a receptor assay. In a receptor assay, the material to be assayed is 
15 appropriately labeled and then certain cellular test colonies are inoculated with a quantity 
of both the labeled and unlabeled material after which binding studies are conducted to 
determine the extent to which the labeled material binds to the cell recqptors. In this way, 
differences in afGnity between materials can be ascertained. 

20 Accordingly, a purified quantity of the Integration-interacting protein (INIP) may be 
radiolabeled and combined, for example, with antibodies or other inhibitors thereto, after 
which binding studies would be carried out. Solutions would then be prepared that 
contain various quantities of labeled and unlabeled uncombined Integration-interacting 
protein (INIP), and cell samples would then be inoculated and thereafter incubated. The 

25 resulting cell monolayers are then washed, solubilized and then coimted in a gamma 
counter for a length of time sufficient to yield a standard error of <5%. These data are 
then subjected to Scatchard analysis after which observations and conclusions regarding 
material activity can be drawn. While the foregoing is exemplary, it illustrates the mannw 
in which a receptor assay may be performed and utilized, in the instance where the 

30 cellular binding ability of the assayed material may serve as a distinguishing 
characteristic. 
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An assay usefiil and contemplated in accordance with fhe present invention is known as a 
"cis/trans" assay. Briefly, this assay employs two genetic constructs, one of which is 
typically a plasmid that continually expresses a particular receptor of interest when 
transfected into an appropriate cell line, and the second of which is a plasmid that 
expresses a reporter such as luciferase, under the control of a receptor/ligand complex. 
Thus, for example, if it is desired to evaluate a componnd as a ligand for a particular 
receptor, one of the plasmids would be a construct that results in expression of the 
receptor in the chosen cell line, while the second plasmid would possess a promoter 
linked to the luciferase gene in which the response element to the particular receptor is 
inserted. If the compound under test is an agonist for the receptor, the ligand will complex 
with the receptor, and the resulting complex will bind the response element and mitiate 
transcription of the luciferase gene. The resulting chemiluminescence is then measured 
photometrically, and dose response cmves are obtained and compared to those of known 
ligands. The foregoing protocol is described in detail in U.S. Pat. No. 4,981,784 and PCT 
International Publication No. WO 88/03168, for which purpose the artisan is referred. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of 
predetermined Integration-interacting protein (INIP) activity or predetemiined 
Integration-interacting protem (INIP) activity capability in suspected target cells. In 
accordance with the testmg techniques discussed above, one class of such kits will 
contain at least the labeled Integration-interacting protein (INIP) or its binding partner, for 
instance an antibody specific thereto, and directions, of course, depending upon the 
method selected, e.g., "competitive," "sandwich," 'T)ASP" and the like. The kits may also 
contain peripheral reagents such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence or 
capability of cells for predetemiined Integration-interacting protein (INIP) activity, 
comprising: 

a) a predetermined amoimt of at least one labeled immunochemically reactive component 
obtained by the direct or indirect attachment of the present Integration-interacting protein 
(INIP) factor or a specific binding partner thereto, to a detectable label; 

b) other reagents; and 
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More specijBcally, the diagnostic test kit may comprise: 

a) a known amoxmt of the Integration-interacting protein (ESflP) as described above (or a 
5 binding partner) generally bound to a solid phase to form an inmiunosorbent, or in the 

alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

b) if necessary, ofh^ reagents; and 

c) directions for use of said test kit. 

10 

In a further variation, the test kit may be prepared and used for the purposes stated above, 
which operates according to a predetermined protocol (e.g. "competitive," "sandwich," 
"double antibody," etc.), and comprises: 

a) a labeled component which has been obtained by coupling the hitegration-interacting 
15 protem QNJP) to a detectable label; 

b) one or more additional immunochemical reagents of which at least one reagent is a 
ligand or an immobilized ligand, which ligand is selected from the group consisting of: 

(i) a ligand capable of binding with the labeled component (a); 

(ii) a ligand capable of binding with a binding partner of the labeled component (a); 
20 (iii) a ligand capable of binding with at least one of the component(s) to be 

determined; and 

(iv) a ligand capable of binding with at least one of the binding partners of at least 
one of the component(s) to be determined; and 

c) directions for the perfoimance of a protocol for the detection and/or detemiination of 
25 one or more components of an immunochemical reaction between the Integration- 
interacting protein (INIP) and a specific binding partner thereto. 

In a specific ambodiment, the present invention also relates to a method of increasing the 
integration of a certaia polycucleotides. For this reason Inip proteins, more in particular 
30 proteins of the family of the hepatoma-derived growth factors, yet more in particular 
Inip76, analogs, derivatives or fragments thereof, polynucleotides or vectors expressing 
them can be used. This can be used in combination with any method of gene therapy. 
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In a particular embodiment, the Integrase-interacting proteins with a sequence 
substantially homologous to LEDGF, their antibodies, agonists, antagonists, modulators 
or active firagments thereof, could be prepared in pharmaceutical formulations or 
compositions for administration in instances wherein anti-HIV therapy is ^>propriate. 
5 Such compounds may thus be prepared in pharmaceutical compositions, with a suitable 
carrier and at a strength effective for administration by various means to a patirat 
experiencing an adverse medical condition associated with HIV-1 infection for the 
treatment thereof. A variety of adnmustrative techniques may be utilized, among them 
parenteral techniques such as subcutaneous, intravenous and intraperitoneal injections, 

10 catheterizations and the like. Average quantities of the antagonist of the Integration- 
interacting protein (IMP) or their subunits may vary and in particular should be based 
upon the reconmiendations and prescription of a quaUfied physician or veterinarian. The 
formiilations include those suitable for oral, rectal, nasal, topical (including buccal and 
sublingual), vaginal or parenteral (including subcutaneous, intramuscular, intravenous, 

15 intradermal, intrathecal and epidural) administration. The formulations may conveniently 
be presented in unit dosage form and may be prepared by any of the methods well known 
in the art of pharmacy. Such methods include the step of bringing into association the 
active ingredient with the carrier which constitutes one or more accessory ingredients. In 
general the formulations are prepared by uniformly and intimately bringing into 

20 association the active ingredient with liquid carriers or finely divided solid carriers or 
both, and then, if necessary, shaping the product. 

The present invention further contemplates therapeutic compositions useful in practicing 
tiie therapeutic methods of this invention. A subject therapeutic composition includes, in 

25 admixture, a phannaceutically acceptable excipient (carrier) and one or more of a 
Integration-interacting protein (INIP), polypeptide analog thereof or fragment thereof, as 
described herein as an active ingi'edient. In a preferred embodiment, the composition 
comprises an antigen capable of modulating the specific binding of the present 
Integration-interacting protein (DSflP) within a target cell. 

30 The present invention also relates to pharmaceutical compositions including one or more 
compounds selected fi'om the molecules (antagonists or modulators) which comprise a 
region specifically binding to a protein of the hepatoma-derived growth factor family or 
nucleic acids encoding said protein of the hepatoma-derived growth factor family, for the 
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manufacture of a medicament for the prevention and/or treatment of a viral infection, 
more particularly selected from antibodies, antisens oligonucleotides, ribozymes, antigene 
therapeutics, small molecules, variant polypeptides, small interfering KNAs or vectors 
used for RNA interference. 

5 

The preparation of therapeutic compositions which contain polypeptides, analogs or 
active fragments, other antagonist or modulators as active ingredients is well understood 
in the art. Such compositions can be prepared as injectables, either as liquid solutions or 
suspensions, however, solid forms suitable for solution in, or suspension in, Uquid prior to 

10 injection can also be prepared. The preparation can also be emulsified. The active 
therapeutic ingredient is often mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof 
In addition, if desired, the composition can contain minor amounts of auxiliary substances 

15 such as wetting or emulsifying agents, pH buffering agents which enhance the 
effectiveness of the active ingredient. 



A polypeptide, analog or active fragment can be formulated into the therapeutic 
composition as neutralized pharmaceutically acceptable salt forms. Pharmaceutically 

20 acceptable salts include the acid addition salts (formed with the free amino groups of the 
polypeptide or antibody molecule) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed from the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 

25 calcium, or fiaric hydroxides, and such organic bases as isopropylaxnine, trimethylamine, 
2-efhylamino ethanol, histidine, procaine, and the like. 

The therapeutic polypeptide-, analog- or active fragment-containing compositions are 
conventionally administered intravenously, as by injection of a imit dose, for example. 
30 The term "imit dose" when used in reference to a therapeutic composition of the present 
invention refers to physically discrete units suitable as unitary dosage for humans, each 
unit containing a predetermined quantity of active material calculated to produce the 
desired therapeutic effect in association with the required diluent; i.e., carrier, or vehicle. 
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The compositions are administered in a maimer compatible with the dosage formulation, 
and in a therapeutically effective amount. The quantity to be administered depends on the 
subject to be treated, capacity of the subject's immune system to utilize the active 
ingredient, and degree of inhibition or neutralization of Integration-interacting piotem 
(INIP) binding capacity desired. Precise amounts of active ingredient required to be 
administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range fiom about 0.1 to 20, preferably about 
0.5 to about 10, and more preferably one to several, milligrams of active ingredient per 
kUogram body weight of individual per day and depend on the route of administration. 
Suitable regimes for initial administration and booster shots are also variable, but are 
typified by an initial administration followed by repeated doses at one or more horn- 
intervals by a subsequent injection or other administration. Alternatively, continuous 
intravenous infusion sufficient to maintain concentrations of ten nanomolar to ten 
micromolar in the blood are contemplated. 

Suitable pharmaceutical carriers for use in said pharmaceutical compositions and their 
formulation are well known to those skilled in the art, and there is no particular restriction 
to their selection within tibie present invention. Suitable carriers or excipients known to the 
skilled man are saline. Ringer's solution, dextrose solution. Hank's solution, fixed oils, 
ethyl oleate, 5% dextrose in saline, substances that enhance isotonicity (such as sugars or 
sodium chloride) and chemical stability, buffers and preservatives. Otiier suitable earners 
include any carrier that does not itself induce the production of antibodies harmful to the 
individual receiving the composition such as proteins, polysaccharides, polylactic acids, 
polyglycolic acids, polymeric amino acids and amino acid copolymers. They may also 
include additives such as wetting agents, dispersing agents, stickers, adhesives, 
emulsifying agents, solvents, coatings, antibacterial and antifungal agents (for example 
phenol, sorbic acid, chlorobutanol) and the like, provided the same are consistent with 
pharmaceutical practice, i.e. carriers and additives which do not create permanent damage 
to mammals. The pharmaceutical compositions of the present invention may be prepared 
in any known manner, for instance by homogeneously mixing, coating and/or grinding 
the active ingredients, in a one-step or multi-steps procedure, with the selected carrier 
material and, where appropriate, the other additives such as surface-active agents may 
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also be prepared by inicronisation, for instance in view to obtain them in the foim of 
microspheres usually having a diameter of about 1 to 10 jim, namely for the manufacture 
of xnicroc£^sules for controlled or sustained release of tiie active ingredients. 

Suitable surface-active agents to be used in the pharmaceutical compositions of the 
present invention are non-ionic, cationic and/or anionic materials having good 
emulsifying, dispersing and/or wetting properties. Suitable anionic surfactants include 
both water-soluble soaps and water-soluble synthetic surface-active agents. Suitable soaps 
are alkaline or alkaline-earth metal salts, unsubstituted or substituted ammonium salts of 
higher fatty acids (CIO-C22), e.g. the sodium or potassium salts of oleic or stearic acid, or 
of natural fatty acid mixtures obtainable form coconut oil or tallow oil. Synthetic 
surfactants include sodium or calcium salts of polyacrylic acids; fatty sulphonates and 
sulphates; sulphonated benzimidazoie derivatives and alkylarylsulphonates. Fatty 
sulphonates or sulphates are usually in the form of alkaline or alkaline-earth metal salts, 
imsubstituted ammonium salts or anunonium salts substituted with an alkyl or acyl radical 
having from 8 to 22 carbon atoms, e.g. the sodium or calciimi salt of lignosulphonic acid 
or dodecylsulphonic acid or a mixture of fatty alcohol sulphates obtained from natural 
fatty acids, alkaline or alkaUne-earth metal salts of sulphuric or sulphonic acid esters 
(such as sodium lauryl sulphate) and sulphonic acids of fatty alcohol/ethylene oxide 
adducts. Suitable sulphonated benzimidazole derivatives preferably contain 8 to 22 
carbon atoms. Examples of alkylarylsulphonates are the sodium, calcium or alcanolamine 
salts of dodecylbenzene sulphonic acid or dibutyl-naphtalenesulphonic acid or a 
naphtalene-sulphonic acid/formaldehyde condensation product. Also suitable are the 
corresponding phosphates, e.g. salts of phosphoric acid ester and an adduct of p- 
nonylphenol with ethylene and/or propylene oxide, or phospholipids. Suitable 
phospholipids for this purpose are the natural (originating from animal or plant cells) or 
synthetic phospholipids of the cephalin or lecithin type such as e.g. 
phosphatidyleflianolamine, phosphatidylserine, phosphatidylglycerine, lysolecitfain, 
cardioHpin, dioctanylphosphatidyl-choUne, dipalmitoylphoshatidyl -choline and their 
mixtures. 

Suitable non-ionic surfactants include polyethoxylated and polypropoxylated derivatives 
of alkylphenols, fatty alcohols, fatty acids, aliphatic amines or amides containing at least 
12 carbon atoms in the molecule, alkylarenesulphonates and dialkylsulphosuccinates. 
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such as polyglycol ether derivatives of aliphatic and cycloaliphatic alcohols, saturated and 
unsaturated fatty acids and alkylphenols, said derivatives preferably containing 3 to 10 
glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) hydrocarbon moiety and 6 
to 18 carbon atoms in the alkyl moiety of the alkylphenol. Further suitable non-ionic 
5 surfactants are water-soluble adducts of polyethylene oxide with poylypropylene glycol, 
ethylenediaminopolypropylene glycol containing 1 to 10 carbon atoms in the alkyl chain, 
which adducts contain 20 to 250 ethyleneglycol ether groups and/or 10 to 100 
propyleneglycol ether groups. Such compounds usually contain from I to 5 ethyleneglycol 
units per propyleneglycol unit. Representative examples of non-ionic surfactants are 

10 nonylphenol -polyethoxyethanol, castor oil polyglycohc ethers, polypropylene/ 
polyethylene oxide adducts, tributylphenoxypolyethoxyethanol, polyethyleneglycol and 
octylphenoxypolyethoxyethanol. Fatty acid esters of polyethylene sorbitan (such as 
polyoxyethylene sorbitan trioleate), glycerol, sorbitan, sucrose and pentaerythritol are 
also suitable non-ionic surfactants. 

15 Suitable cationic sxirfactants include quaternary ammonium salts, preferably halides, 
having 4 hydrocarbon radicals optionally substituted with halo, phenyl, substituted phenyl 
or hydroxy; for instance quaternary ammonium salts containing as N-substituent at least 
one C8C22 alkyl radical (e.g. cetyl, lauryl, palmityl, myristyl, oleyl and the like) and, as 
further substituents, unsubstituted or halogenated lower alkyl, benzyl and/or hydroxy- 

20 lower alkyl radicals. 

A more detailed description of surface-active agents suitable for this purpose may be 
found for instance in "McCutcheon*s Detergents and Emulsifiers Annual" (MC PubUshing 
Crop., Ridgewood, New Jersey, 1981), "Tensid-Taschenbucw', 2 d ed. (Hanser Verlag, 
Vienna, 1981) and "Encyclopaedia of Surfactants, (Chemical Publishing Co., New York, 

25 1981). 

Additional ingredients may be included in order to control the duration of action of the 
active ingredient in the composition. Control release compositions may thus be achieved 
by selecting appropriate polymer carriers such as for example polyesters, polyamino 
acids, polyvinyl pyrrolidone, ethylene-vinyl acetate copolymers, methylcellulose, 
30 carboxymethylcellulose, protamine sulfate and the like. The rate of drag release and 
duration of action may also be controlled by incorporating the active ingredient into 
particles, e.g. microcapsules, of a polymeric substance such as hydrogels, polylactic acid, 
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hydroxymethylcellulose, polyniethyl methacrylate and the other above-described 
polymers. 

Such methods include colloid drug delivery systems like liposomes, microspheres, 
microemulsions, nanoparticles, nanocapsules and so on. Depending on the route of 
administration, the pharmaceutical composition may require protective coatings. 
Pharmaceutical forms suitable for injectionable use include sterile aqueous solutions or 
dispersions and sterile powders for the extemporaneous preparation thereof. Typical 
carriers for this purpose therefore include biocompatible aqueous buffers, ethanol, 
glycerol, propylene glycol, polyethylene glycol and the like and mixtures thereof. 

The therapeutic compositions may include an effective amount of the Integration- 
interacting protein (INEP) antagonist, and other active ingredients. Exemplary 
formulations are given below: 



Formulations 

Ligredient mg/ml 



Intravenous Formulation I 
cefotaxime 250.0 

Integration-interacting protein (DSflP) antagonist 10.0 
dextrose USP 45.0 
sodium bisulfite USP 3.2 
edetate disodium USP 0.1 
water for injection q.s.a.d. 1.0 ml 
Intravenous Formulation II 
ampicillin 250.0 

Integration-interacting protein (INIP) antagonist 10.0 
sodium bisulfite USP 3.2 
disodium edetate USP 0.1 
water for injection q.s.a.d. 1 .0 ml 
Intravenous Formulation III 

gentamicin (charged as sulfate) 40.0 
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Integration-interacting protein (INIP) antagonist 10,0 

sodium bisulfite USP 3.2 

disodium edetate USP 0.1 

water for injection q.s.a.d. 1.0 nd 
5 Intravenous Formulation IV 

Integration-interacting protein (INIP) antagonist 10.0 

dextrose USP 45.0 

sodium bisulfite USP 3.2 

edetate disodium USP 0.1 
10 water for injection q.s.a.d. 1 .0 nol 

Intravenous Formulation V 

Integration-interacting protein (INIP) antagonist 5.0 

sodium bisulfite USP 3.2 

disodium edetate USP 0.1 
15 water for injection q.s.a.d. 1.0 ml 



The minimal nuclear HIV-l IN complex is a homotetramer 

20 A serious obstacle in working with recombinant retroviral integrases is their poor 
solubility and propensity for aggregation. When produced in bact^a, HIV IN has to be 
extracted firom inclusion bodies; detergents are used to stabilize the enzyme in solution. 
All crystal structure and some in vitro multimerization studies have been done witii the 
soluble mutants. Furthermore, recent reports raised concern that the stoichiometry and 

25 enzymatic activities of recombinant IN can be affected by the enzyme preparation (Leh et 
al., 2000; Sinha et al., 2002). The goal of the research was to study protein complexes that 
HIV-1 IN forms witiiin nuclei of human cells. The bulk of HIV-1 IN present in 293T cells 
that stably produce this viral protein, is associated with the insoluble nuclear fraction. 
Although IN seems to be directiy or indirectiy bound to chromosomal DNA, this may not 

30 be the only factor in nuclear retention of IN, since digestion of the detergent- 
permeabilized nuclei with nucleases was not sufficient to elute IN. This study was 
directed on IN complexes extracted from the detergent-permeabilized nuclei in hypertonic 
conditions. The present invention relates to the existence of the salt-eluted IN as part of a 
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distinct 61 A complex, which is not stable in diluted nuclear extracts and dissociates, 
releasing a 41 A core molecule. According to the cross-linking and gel filtration data, the 
present invention also relates to the 41 A molecule being a tetramer of IN, Indeed, the 
complex exists in immunoprecipitated INf preparations. Its molecular weight detennined 
both by gel filtration and cross-linMng (115-150 kDa) is close to the value expected for 
the presumed tetramer (128 kDa). Finally, partial cross-Mnking of the diluted nuclear 
extracts and of ttie purified tagged IN protein produced bands at the positions 
corresponding to the IN monomer, dimer and trimer, in addition to pi 50^^ (data not shown 
and Figure 2A, lane 7). It was estimated that the concentrations of IN and INf in the 
nuclear extracts did not exceed 10 nM (<0,3|xg/ml IN at lOO^ig/ml total protein). 
Therefore, the IN tetramer was stable even at subnanomolar concentrations (i.e. in the 
extracts diluted to 4 |j,g/ml of total protein), showing that the minimal nuclear IN complex 
is a tetramer. 

Composition of the 61 A HIV-] IN complex 

The native 61 A IN complex was found to decay during prolonged incubations and not to 
be stable during purification. When the immunoprecipitation time was shortened and 
higher initial extract concentrations were used, a cellular protein with an apparent MW of 
76 kDa specifically co-precipitated with the epitope-tagged IN. The protein, or Inip76, as 
we refer to it, was found to be identical to human LEDGF;DFS70/p75 protein. It was 
fiirther confirmed that Imp76/LEDGF is a part of the native 61 A IN complex, since 
addition of a monoclonal anti-LEDGF antibody shifted elution of the complex fi-om a gel 
filtration column. Importantly, when LEDGF protein was present, the p300''' band became 
visible after DTSSP cross-linking of the immunoprecipitated INf. This band could not be 
reproduced in LEDGF-negative INf samples obtained foUoAving the long 
immunoprecipitation protocol. P300^^ seems to be the result of partial cross-linking of the 
61 A IN complex for the following three reasons: (i) the MW was lower than the expected 
value for a 61 A globular complex («400 kDa); (ii) DTSSP cross-linking products with 
apparent MWs higher than that of p300^" were visible on anti-IN immunoblots; (Hi) this 
product did not react with anti-LEDGF antibody. Based on its apparent molecular weight, 
we speculate that p300*'^ probably represents two cross-linked IN tetramers. 
The exact stoichiometry of the IN-Inip76 complex was investigated. At the concentrations 
it is present in nuclear extracts, the 61 A complex was not stable enough to allow 
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measurement of its sedimentation coefficient, which is required to detennine its precise 
MW (Siegel and Monty, 1965). Assuming that the 61 A complex is globular, it can be 
estimated that its MW is around 400 kDa. The simplest model compatible with this MW 
is of a symmetrical complex containing a pair of IN tetrameis and two subunits of fiiip76, 
5 which corresponds to a macromolecule of 370 kDa. In a particular embodiment of the 
invention, the native 61 A complex contains another cellular protein. 

Retroviral IN within PIC: a dimer of dimers or a dimer oftetramers? 

During reverse transcription, the two retroviral cDNA termini are not completed 

10 simultaneously and both seem to be substrates for the 3'-processing activity of IN as soon 
as they appear (Miller et al., 1997). Moreover, at least in case of HIV, 3 '-processing of 
one LTR end was observed when the second end was non-functional and also did not 
support normal intasome assembly (Chen and Engelman, 2001). On the other hand, two 
functional LTRs were found to be required for strand transfer activity of isolated HIV 

15 PICs. Therefore, although LTRs can be processed asymmetrically, a synaptic complex 
involving both LTRs must form to aUow strand transfer, ensuring that only legitimate 
integration of both retroviral cDNA ends occurs. Based on a comparison to Mu phage 
transposase and available crystal structure data, it has been suggested that the active form 
of retroviral IN is a tetramer (a dimer of dimers) (Wang et al., 2001). The results of fliis 

20 research show that HIV IN expressed in human cells is indeed present as a stable 
tetramen Recombinant HIV-1 IN as well has been recently reported to form stable 
tetramers in diluted solutions (Leh et al,, 2000). Intriguingly, both Mu phage and Tn5 
transposases form active multimers (tetramers and dimer respectively) only within their 
synaptic complexes. Thus, independent transposase subumts must first bind to the ends of 

25 the transposon genome, before being brought together to forai the synaptic complex. 
Extrapolating this scheme to the retroviral PIC assembly, the stable IN tetramers can be 
looked at as such independent subimits, which first have to bind to one LTR end each 
before interacting with each other. According to this model, each individual tetramer 
would be capable of carrying out 3 '-processing reaction, whereas a dimer of tetramers 

30 would be necessary to accomplish the strand transfer. The data presented here show that 
the cellular HIV-1 IN 61 A complex contains two IN tetram^ that are close enough to be 
cross-linked to each other. This dimer oftetramers could represent the configuration of IN 
required for concerted integration. 
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What is the role ofInip76 in retroviral replication? 

The human Ihil/hSNF5/BAF47 protein has been shown to interact with HIV-1 IN in a 
yeast two hybrid screen and in vitro. Neither co-immnnoprecipitation of Ml with IN 
from nuclear extracts was observed; nor the co-localization of the two proteins in the cell 
was detected. Therefore, it can be stipulated that the cellular IN-Inil interaction might be 
transient and only occurring during certain steps of viral repUcation. The present 
invention shows that IN complexes extracted from the cell hnes stably expressing HIV-1 
IN contained another hxmian protein, Inip76, identical to LEDGF/DFS70/p75. INf and 
Inip76 co-locaUzed in the nuclei of 293T-IISPalaFLAG cells, confirming that the observed 
interaction was not an artifact of the extraction. So far, there has been no previous account 
of a link between LEDGF/DFS70/p75 and retroviral repUcation. No obvious relation 
seems to exist between this protein and known chromatin remodeling factors. 
Based on sequence similarity, Imp76/LEDGF/DFS70/p75 is a member of the hepatoma- 
derived growth factor (HDGF) family that includes HDGF and several other HDGF- 
related proteins (HRPs) (reviewed in (Dietz et al., 2002)). High degree of homology 
exists between the amino terminal regions of these proteins. The PWWP motif (70 
residues containing the Pro-Trp-Trp-Pro core sequence) is located within the amino 
terminal homology region of HRPs and relates them to a larger and ftmctionally diverse 
nuclear protein family, that includes DNA-binding transcription factors and enzymes 
involved in DNA repair and DNA melhylation. PWWP domains are thought to be 
implicated in protein-protein interactions (Stec et al., 2000), 

An eye-catching feature of Inip76 is the abundance of the charged amino acids 
comprising approximately 40% of the total residues. Screening its sequence against the 
BLOKS+ protein motif database (Henikoff et al., 1999) revealed fragments with 
similarity to the HMG-I(Y) DNA AT hook sequence (data not shown). LEDGF has been 
shown to be a DNA-binding protein with affinity for heat shock and stress-related DNA 
elemmts (HSE and STREs) (Singh et al., 2001). It has also been reported to interact with 
components of the general transcription machinery and witti the transcription activation 
domain of VP16 (Ge et al., 1998). Recent reports suggested that LEDGF plays an 
important role in regulating expression of the stress-response genes (Fatma et al., 2001; 
Shinohara et al., 2002). As mentioned above, altemative spUcing of LEDGF pre-mRNA, 
allows expression of the second protein, p52, from the same gene. A growing body of 
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evidence suggests that p75 and p52 may have differait functioiis. Although they both can 
interact vdth PC4, VP 16 and general transcription factors, at least in vitro, p52 displays 
higher transcription activation activity (Ge et al., 1998). In addition, p52 and not p75 has 
been shown to functionally interact with the ASF/SF2 splicing factor in vitro (Ge et al., 
5 1998). The proteins also differ in their nuclear distribution patterns (Nishizawa et al., 
2001). Intracellular levels of p52 appear to be much lower than those of p75, at least in 
the cell lines we have tested (HEK-293, 293T, HeLa, GEM) (data not shown). We have 
not detected co-immunoprecipitation of p52 with IN firom the nuclear cell lysates of IN- 
expressing 293T cells. However, it remains to be determined whether p52 is able to 
10 interact with mV IN. 

What is the role of the Inip76-IN interaction in the retroviral life cycle? 

The fact that recombinant Inip76 was able to stimulate HIV-1 IN activity in vitro shows a 

direct involvement of Imp76 in the integration process. As a chromosome-associated IN- 

15 binding protein, Imp76 could serve as a docking factor for the PIC, It might thus be 
functionally similar to the Mu phage transposition co-factor MuB that associating wdth 
tiie acceptor DNA makes it a preferred target for transposition. It is also conceivable that 
Inip76 migiht be a part of the retroviral PIC. The experiments show that ]hip76 is an 
essential factor in the HIV-repUcation and therefore Inip76 constitutes a novel target for 

20 anti-retroviral therapy directed against its interaction with DSf. Alternatively, a therapeutic 
strategy based on modified Inip76 protein, designed to capture the viral IN in a 
catalytically quiescent complex, is also part of the invention. 

25 Examples 

Methods 

Example 1 Recombinant DNA 




30 The HIV-1 integrase expression constmcts were based on the episomal pCEP4 vector 
(Invitrogen, Groningen, The Netherlands). The plasmid pCEP-DSf^ala is identical to the 
published pCBP-IN^ plasmid (Cherepanov et al,, 2000) with the difference that the Gly 
codon in the second position of the synthetic open reading frame (ORF) was mutated to 
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Ala. As a result, the construct expressed native HIV-1 IN with an addition of Met- Ala 
dipeptide at the N-terminus. To create the FLAG epitope-tagged IN expression construct 
pCEP-DSf^alaFLAG, the IN synthetic ORF from pCEP-lN^ala was ampUfied in two 
consecutive steps with the sense primer 5'-GGCTAGATATCACTAGCAAC 
5 CTCAAACAG plus two anti-sense primers 5'-GTCGTCCTTGTAATCGCCGTCC 
TCATCTTGACGAGAG and S'-GGCGCTCGAGTTACTTGTCATCGTCGTCCTTGT 
AATCGC, the resulting PGR fragment was digested with Xhol and cloned between the 
EcoRI (blunt) and Xhol sites of the pCEP4 vector. This plasmid expressed HIV-1 IN 
carrying the C-terminal FLAG epitope (DYKDDDDK). The plasmid pRP1012, for 

10 bacterial expression of His6-tagged HIV-1 IN, was a gift of Dr. R. Plasterk (The 
Netherlands). To obtain pCP6H75, the plasmid used for bacterial expression of His6- 
tagged Inip76/LEDGF, the PGR fragment amplified from a HeLa cDNA with the primers 
5'-GGCCGGATCCGACTCGCGATTTCAAACCTGGAGAC and 5'-CCGCGAATTCT 
AGTTATCTAGTGTAGAATCCTTC was digested with BamHI and EcoRI and cloned 

15 between the BamHI and EcoRI sites of pRSETB (Invitrogen). To prepare the mini-HTV 
IN substrate, the plasmid pU3U5 (Cherepanov et al., 1999) was digested with Seal. 

Example 2: Cell Culture 

The human embryonic kidney cells expressing SV40 large T antigen, 293T were obtained 
20 from Dr. O. Danos (Evry, France). Cells were grown in Dulbecco's modified Eagle's 
medixmi (DMEM) (Invitrogen) supplemoited with 10% fetal calf serum (FCS), 2 mM 
glutamine and 20 |ag/ml gentamicin at 3TC in 5% CO2 humidified atmosphere. To 
establish stable cell lines, 293T cells were transfected by electroporation with integrase 
expression constructs and selected with 200 |ag/ml of hygromycin B (Invitrogen). The 
25 selected cell lines, 293T-IN^ala and 293T-IN^alaFLAG, expressed IN at the levels similar 
to the previously reported 293T-IN^ cells (Cher^anov et al., 2000) as determined by 
western blotting and immunofluorescence. IN expression was stable for over 40 passages. 
For radioactive immunoprecipitation experiments cells were labeled in 
methionine/cysteine-free DMEM (Invitrogen) supplemented with 10% dialyzed PCS plus 
30 O.lmCi/ml of TRAN^^S-LABEL (ICN Biomedicals, Asse-Relegem, Belgium) for 
24 hours. 
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Example 3: Preparation of Nuclear Extracts 

293T-l]>rala or 293T-IN^alaFLAG cells grown to a confluency of 80-90% were 
dissociated from plastic dishes using trypsin, washed with PBS and re-suspended in the 
5 modified cytoskeleton buffer (10 mM Pipes [pH 6,8], 10% (w/v) sucrose, 1 mM DTT, 
1 mM MgCl2 plus the EDTA-free protease inhibitor cocktail (Roche, Brussels, Belgium)) 
(Fujita et al., 1997) containing 100 mM NaCl (referred to as lOOCSK buffer). CeUs were 
lysed for 10 min on ice with 0.5% NP40; nuclei were pelleted and washed with lOOCSK. 
To extract IN, nuclei were re-suspended in the 400CSK buffer (same as lOOCSK, but 
10 containing 400 mM NaCl) and left on ice for 5 min, the chromatin was removed by 
centrifugation at 7,500 rpm for 2 min. The total protein content of the nuclear extracts 
was measured using the BCA protein assay (Pierce, Rockford, IL), with bovine serum 
albumin (BSA) as the standard. 

15 Example 4: Chemical Cross-linking 

The nuclear extracts were diluted using 400CSK buffer to adjust the total protein 
concentration to 100, 20 or 4 |xg/ml. DTSSP (Pierce, Brembodegem-Aalst, Belgium) was 
dissolved in water immediately prior to flie experiment and used at 2, 0.5 or 0.1 mM. 
Cross-linking reactions were allowed to proceed for 15 min at room temperature, were 

20 terminated by addition of Va volume of the 4 x SDS sample buffer (200 mM Tris HCl [pH 
6.8], 4% SDS and 40% (v/v) glycerol) and incubation at room temperature for 20 min. 

Example 5: Gel Filtration Chromatoeravhv 

Nuclear extracts and immxmopurified INf were fractionated on a Superdex 200 HR 10/30 
25 gel filtration column (Amersham-Pharmacia). The 400CSK buffer was used in all 
chromatography experiments. The column was operated at 0.6ml/min, 4*^C, and 
calibrated using low and high MW gel filtration standards from Amersham-Pharmacia 
(blue dextran; thyroglobulin, Rs/Mw 85 A/669 kDa; ferritin, 61 A/440 kDa; catalase, 
52.2 A/232 kDa; aldolase, 48.1 A/158 kDa; BSA, 35.5 A/67 kDa; chymotrypsinogen A, 
30 20.9 A/25 kDa). The sample volume was kept at 200 lul; fractions of 300 |ll1 were 
collected and analyzed by western blotting using polyclonal anti-IN antibodies. When 
necessary, gel filtration fractions were concentrated by precipitation with trichloroacetic 
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acid. Stokes radii (Rg) and approximate MWs of the IN complexes were determined from 
their experimental partition coefficients (Kav) as described (Siegel and Monty, 1965). 

Example 6: Western Blottine^ and Immunoprecivitation 
5 The gradient 4-12 and 4-20% Novex Tris-Glycine gels were purchased from Ihvitrogen. 
Proteins were transferred onto PVDF membrane (Bio-Rad, Nazareth, Belgium); detection 
was done with ECL+ (Amersham-Pharmacia, Roosendaal, The Netherlands). The rabbit 
polyclonal anti-HIV-1 IN were home made and used at a final dilution of 1:30.000 
(Cherepanov et aL, 2000). The anti-FLAG M2 monoclonal antibody was from Sigma- 

10 Aldrich, the monoclonal anti-DNA-PKcs Ab-4 cocktail from NeoMarkers (Fremont, CA) 
and the monoclonal anti-LEDGF p75/p52 from BD Biosciences (Erembodegem, 
Belgium). The affinity purified anti-hMCM3 polyclonal antibody was a gift from Dr. 
Knippers R. (Universty of Konstanz, Germany). A combination of prestained MW 
markers (New England Biolabs, Hertfordshire, UK) and Markl2 (Invitrogen) was used to 

15 estimate MWs of the cross-linking products and Inip76. DNAPKcs detected in a 293T 
nuclear lysate sample using the anti-DNAPKcs Ab4 antibody served as the 470 kDa 
marker in some western blots. 

In the initial inununoprecipitation experiments, 30 pi of protein G agarose 
(Roche) and 1-3 |ig of the anti-FLAG M2 antibody was added to the nuclear extracts 

20 prepared in 400CSK and diluted to obtain total protein concentration of 200 jag/ml; the 
suspension was strirred at 4^C overnight (12-18 hours). The agarose beads were washed 
once with 400CSK and 4 times with lOOCSK plus 0.1% NP40. Protein was eluted in 
400CSK buffer by addition of 200|Lig/ml FLAG peptide (Sigma-Aldrich) or in 
SDS PAGE sample buffer. To purifiy LEDGF-INf complexes, immunoprecipitation was 

25 carried out using undiluted nuclear extracts (700 M-g/ml total protein) for 3-5 hoxirs. To 
identify the Inip76 protein by N-terminal sequencing and mass spectrometry, the 
procedure was upscaled. 293T-IN*alaFLAG cells were grown to confluency on five 
500 cm^ dishes (VWR International, Leuven, Belgium), lysed with 0.5% NP40. IN 
complexes were extracted from the nuclear pellets into 13 ml of the 400CSK bviffer and 

30 incubated with 300 \x\ of protein G agarose beads and 40 |ig of the anti-FLAG M2 
antibody for 4.5 hours. The INf complexes were eluted in 700 ^il of the 400CSK buffer 
with 200 ^ig/ml FLAG peptide. 
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Example 7: Amino Terminal Seauencins and Mass Spectrometry 

TmTniinn puri fi ed INfImp76 complexes were precipitated with trichloroacetic acid and 
redissolved in the SDS PAGE sample buffer. Approximately 3 \xg of the Ihip76 protein, 
electroblotted onto a Sequi-Blot PVDF membrane (Bio-Rad) from an SDS PAGE gel, 

5 was subjected to Edman degradation on a pulsed liquid phase Procise491cLC protein 
sequencer (Applied Biosystems, Lenoik, Belgium). For mass spectrometry analysis the 
coomassie blue-stained band of Inip76 was cut from an SDS PAGE gel, destained in a 
200 mM ammonium bicarbonate/50% acetonitrile, air-dried and soaked in 8 }il trypsin 
solution (16 ng trypsin (Promega) in 50 roM ammonium bicarbonate) on ice for 20 min. 

10 Following overnight digestion at 37°C, the supematant was recovered, and the gel slice 
was extracted twice using 60% acetonitrile/0.1% formic acid. The extracts and the 
supernatant were pooled and dried in a Speedvac concentrator. The peptides were re- 
dissolved in 0.1% formic acid and analysed by on-line nanoflow high performance liquid 
chromatography tandem mass spectrometry (LC/MS/MS) on an UltiMate capillary LC 

15 system (LC-Packings, Amsterdam, The Netherlands) coupled to a Q-Tof mass 
spectrometer (Micromass, Manchester, UK) equipped witii an electrospray ionization 
source. Technical details of tins system are reported elsewhere (Devreese et al., J. 
Chromatography A, in press). All spectra were processed using the MassLynx and 
MaxEnt software delivered with the mass spectrometer, 

20 

Example 8: Indirect Immunofluorescence Microscopy 

293T-IN^alaFLAG cells grown in Lab-Tek 11 glass chamber slides (VWR International) 
were fixed with 4% formaldehyde in phosphate buffered saline (PBS) for 10 min and 
permeabilized in ice-cold methanol. The cells were further blocked in PBS supplemented 

25 vnth 20 mM ammonium chloride and 10% FCS and incubated with rabbit polyclonal anti- 
FLAG antibodies (diluted 1:10.000 in PBS/10% FCS) (Sigma-Aldrich) and monoclonal 
anti-LEDGF (1:300) or anti-DNA PKcs (1:300) followed by Alexa-555 anti-rabbit and 
Alexa-488 conjugated anti-mouse IgG antibodies (Molecular Probes, Leiden, The 
Netherlands). The nuclear DNA was labeled with 5 pM To-Pro3 iodide (Molecular 

30 Probes). Confocal laser scanning fluorescent microscopy and imaging was carried with an 
LSM510 system (Carl Zeiss, Zaventem, Belgium) using a 488 nm Ar ion laser with a 
505-530 nm band pass filter for Alexa-488, a 543 nm HeNe laser with a 565-615 nm filter 
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for Alexa-555 and a 633 nm HeNe laser with a low pass 650 nm filter for ToPro-3. All 
acquisitions were done in the multi-track mode. 

Example 9: Recombinant proteins 
5 The His6-tagged HIV-1 IN was produced fix>m the pRP1012 plasmid in the Endo I-free 
host Escherichia coli strain PCI (BL21CDE3), AendL4::Tc^ pLysS) (Cherepanov et al., 
1999). The protein was purified firom the soluble fraction by chromatography on Ni-NTA 
agarose (Qiagen, Hilden, Germany) and Heparin Sepharose (Amersham Pharmacia, 
Uppsala, Sweden) in the presence of 7.5 mM 3-[(3- 

10 cholaniidopropyl)dimethylanimonio]propanesulfonate (CHAPS) (Sigma- Aldrich). The 
His6-tag was removed by incubation of the purified protein with thrombin (Novagen). 
The His6-tagged Inip76 was expressed from the plasmid pCP6H75 in PCI by induction 
with 1 mM isopropylthiogalactopyranoside (IPTG) at 29*'C for 3 hours in LB medium. 
The soluble protein was then purified by batch adsorption to Ni-NTA agarose and 

IS chromatography on a 1 ml HiTrap Heparin Sepharose column (Amersham Phamacia). 
The protein was eluted from the Heparin Sepharose colunm using a linear NaCl gradient 
in 30 mM TrisHCl [pH 7.0]. Peak fractions collected at approximately 800 mM NaCl 
were pooled and concentrated using Centricon-30 (Millipore, Brassels, Belgiimi) to 
obtain the final protein concentration of 1 mg/ml. The protein was estimated to be at least 

20 95% pure by SDS PAGE. 
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Examvle 10 Disruption of the nuclear and chromosomal localization of HIV IN using 
small interferini^ RNA molecules specific to the Inip76 mRNA. 

RNA interference with synthetic short RNA duplexes has been used to knock-down 
5 expression of endogenous genes in mammalian cells (Elbashir et aL, 2002). We used 
short interfering RNA (siRNA) duplexes to study the effect of depletion of Inip76 in 
293T-INsalaFLAG cells on the distribution of HIV-1 IN 

i): siRNA Preparation 

10 AU oligonucleotides for siRNA preparation were chemically synthetized by 

Dhannacon (CO, USA) and Xeragon (MD, USA) and were composed of ribonucleotides 
(A,U,G and C) pliis a pair of 3 '-terminal desoxy-thymidines (dTdT). 

Small interfering RNA molecules (siRNA) were prepared by annealing the 
following pairs of oligonucleotides: 

15 i) CAGCCCUGUCCUUCAGAGA-dTdT plus UCUCUGAAGGACAGGGCUG-dTdT, 
to obtain 76A RNA; 

ii) AGACAGCAUGAGGAAGCGA-dTdT plus UCGCUUCCUCAUGCUGUCU-dTdT, 
to obtain 76B RNA; 

iii) CAGAUGCAUUGAGGCCUUG-dTdT plus CAAGGCCUCAAUGCAUCUG-dTdT, 
20 to obtain 76C RNA; 

iv) GCGCGCUUUGUAGGAUUCG-dTdT plus CGAAUCCUACAAAGCGCGC-dTdT, 
to obtain NC RNA. 

The 76A, 76B and 76C siRNA molecxxles are thus designed to be specific for the Inip76 
mRNA and contain an anti-sense strand that can hybridize to the target mRNA. The NC 
25 molecule is an RNA duplex which is not specific to Inip76 RNA nor to any other known 
human mRNA. 

a. Transfection of the 293T''IN'alaFLAG Cells with the siRNA Molecules 
293T-IN^alaFLAG cells expressing FLAG-epitope tagged HTV-l IN were seeded. 
30 into 8 well Nunc LabTekll chamber glass slides (VWR International) and were 
transfected at a confluency of about 25% with each siRNA using GeneSilencer, a 
liposomal transfection reagent purchased firom GTS (Gene Therapy Systems, CA, USA). 
Transfection was done according to the manufacturer's recommendations. Briefly, for 
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transfection of each well, 0.3 |ag of RNA and 1.75 of the GeneSilencer reagent were 
used to transfect each well of the slide. 

Expression and distribution of IN and Inip76 was assessed by indirect 
immunofluorescence. Two. days post transfection the cells were fixed with 4% 
5 formaldehyde in PBS for 5 minutes, followed by four washes in PBS and treated with ice- 
cold methanol for 5 minutes. Fixed and penneabilized cells were then incubated in 
blocking solution (10% fetal calf serum/10 mM ammonium chloride in PBS) for 30 
minutes at room temperature. The cells were incubated with mouse monoclonal anti- 
human LEDGF antibody (pxirchased from BD Bioscience, KY, USA) diluted 1:200 in 

10 blocking buffer or monoclonal anti-FLAG M2 (Sigma- Aldrich) diluted 1:200 for 1 hour. 
Non-boimd primary antibodies were washed in three changes of blocking buffer. Cells 
were stained for 1 hour with Alexa-488 conjugated goat anti-mouse antibody (Molecular 
Probes) diluted 1:300 in blocking buffer and washed with blocldng buffer. Chromosomal 
DNA was stained by addition of 5 jxM ToPro3 iodide (Molecular Probes) to the 

15 secondary antibody solution. 

Western blot analysis demonstrated liiat the expression levels of HIV-1 IN in the 
cells transfected with 76A siRNA were reduced dramatically (not shown), suggesting that 
Inip76 is necessary for stability of the IN protein in human cells, likely protecting IN 
20 from proteolysis. 



Example IL Recombinant Inip76-IN Complex Produced in Escherichia coli. 

25 As elaborated above, recombinant Imp76 is a potent stimulator of HIV IN activity 

in vitro. We wanted to reconstruct the Inip76-IN complex from recombinant proteins, as 
such complex can be potentially used to study interactions of the proteins on molecular 
level as well as an enzyme for DNA recombination in vitro and in vivo. Biochemical 
studies on HIV IN as well as other retroviral INs suffer from poor solubility of the 

30 proteins. We found that adding purified recombinant Imp76 to IN solutions had a 
solubilising effect on the IN protein. Thus, when both proteins were mixed at a 
concentration of 0.25 mg/ml in 150 mM NaCl/10 mM Hepes/1 mM MgCh/O.l % NP40. 
Approximately 50 % of the IN protein remained soluble after 45 minutes iacubation at 
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4 °C and centrifugation at 20000 g for 5 minutes. In contrast, only about 5 % of IN was 
recovered in soluble form when Inip76 was omitted from the mixture. This result 
indicates that tiie complex between IN and Inip76 is more soluble than free recombinant 
IN, and that such complex can be formed using recombinant proteins. A soluble form of 
5 retroviral integrase in complex with its chromosomal receptor migiht help to determine tiie 
structure of tihie active form of the retroviral IN. In addition, if active such complex could 
be used as an enzyme for in vitro DNA recombination. 

i. Construction of the Inip76 Expression Plasmid pCP-Nat75 

10 

The open reading frame of Imp76/LEDGF was PGR amplified from pCP6H75 using 
the primers 5'-TGACTCGCGATTTCAAACC and 5'-CCGCGAATTCTAGTTATC 
TAGTGTAGAATCCTTC. The resulting DNA fragment was digested with EcoRl and 
subcloned between Ndel (treated with T4 DNA polymerase to obtain blunt terminus) and 
15 EcoECL sites of the pRSETB vector (Invitrogen). The complete Inip76 open reading frame 
and the phage T7 promoter region of the resulting plasmid, pCP-Nat75, was sequaaced to 
confirm that no mutation occurred. The plasmid was transformed into the coli PCI 
strain (E. coli BL21(DE3), AendAwTc^, pLysS) (Cherepanov et al., 1999) by standard 
methods. The resulting strain, PC2LEDGF expressed Inip76 upon induction with IPTG. 

20 

ii. Construction of the IN Expression Plasmids pKB"IN6H and pKB-Al 5IN6H 

To constmct the plasmid for bacterial expression of HTV-l IN with C-tenninal Hise 
tag, we amplified the fiill IN open reading frame using PGR with the primers 5'-GCGCG 

25 TCGACATCCTCATCCTGTCTAC and 5'-AATACGACTCACTATAGGG from the 
pINSD plasmid (obtained from The NM AIDS Research and Reference Reagent 
Program, catalog #2820). The RCR fragment was digested with Ndel and Sail and 
subcloned between Ndel and Sail sites of pET-20b(+) (Novagen). The open reading frame 
of IN as well as the T7 promoter region of the resulting plasmid pKB-IN6H was 

30 sequenced to verify absence of mutations. When. the plasmid was transformed into PCI 
bacteria, it expressed HIV-1 C-terminally Hise-tagged IN. However, pKB-IN6H is not 
able to stably co-exist with pCP-Nat75, as both plasmids have the same type of 
replication origin. To obtain pKB-A16IN6H, an IN-expression plasmid compatible with 
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pCP-Nat75, we inserted the BglWDraUL fragment of pKB-IN6H between BamHl and 
Bsal sites of pACYC177 vector (Chang and Cohen, 1978) (available from New England 
Biolabs) (the DraUl and Bsal ends of the DNA fragments were treated with T4 DNA 
polymerase to obtain ligatable blunt termini). 

iii. Co-expression of Inip76 and His6-tagged HIV-1 IN in Escherichia coli 

The plasmid pKB-A15IN6H was transformed into the PC2LEDGF E. coli strain 
(BL21(DE3), 2lenA4::Tc^ pLysS, pCP-Nat75) by standard methods. After 
transformation, bacteria harboring the newly transformed plasmids were selected on LB 
agar plates containing 120 jxg/ml ampicillin, 40 fxg/ml kanamicin and 15 ^ig/ml 
tetracycline at 28 °C. This procedure resulted in introduction of the pKB-A15IN6H 
plasmid and knock-out of the incompatible pLysS plasmid. The resulting strain 
PC2LEDGF-C (BL21(DE3), zlendL4::Tc^pCP-Nat75, pKB-A15IN6H) was grown on LB 
agar plates and in liquid LB medium supplemented with 120 Kig/ml ampicillin, 40 |Lig/ml 
kanamicin and 15 jig/ml tetracycline at 28 °C. Expression of Inip76 and IN was induced 
by addition of 1 mM IPTG at 29 °C in LB medium. 

iv Purification of the Inip76-IN Complex Expressed in E, coli 

An overnight culture of PC2LEDGF-C (100 ml) was diluted with 4 1 of fresh LB 
medium supplemented with 120 |ag/ml ampicillin and 40 iiig/ml kanamicin and grown in a 
shaking incubator at 28 °C until its optical density (measured at 600 nm) reached 
approximately 0.8, then the culture was induced with 1 inM IPTG. After 3 hours 
expression at 29 ^C the bact^ia were collected by centrifugation and kept frozen 
overnight at -70 °C. Next day defrozen bacteria were re-suspended in 20 ml of cell 
breaking buffer (500 mM NaCl/30 mM Hepes/1 mM MgCVO.l mM PMSF) and lysed by 
two passages through a French press at a cell pressure of 18000 psi. The resulting 
bacterial lysate was cleared by centrifugation at 15000 g for 30 min supplemented with 25 
mM imidazol and mixed with 1.5 ml of Ni-NTA resin (Qiagen) equilibrated with the 
same buffer. The slurry was gently mixed for 30 minutes to allow His-tagged protein to 
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bind to the resin. The resin was then packed into a disposable gravity flow column and 
washed with 50 ml of 400 mM NaCl/30 mM Hepes/1 mM MgCl2/25 mM imidazol, 
pH7.2 foUowed by 12 ml of 400 mM NaCl/30 mM Hepes/1 mM MgCb/SSmM 
imidazol, pH7.2. The bound protein was then eluted in the elution buffer (400 mM 
5 NaCl/30 mM Hepes/1 mM MgCl2/200 mM imidazol, pH 7.2). Both IN and Inip76 eluted 
in the presence of 200 mM imidazol, as can be seen from the SDS-PAGE gel shown in 
FIG. 9A. In a control experiment, carried-out with the lysate of induced PC2LEDGF, 
Inip76 was detected only in the non-bound fraction, and no Inip76 was detected in the 
fractions eluted with 200 mM imidazol (FIG. 9B). Therefore, as was expected, in the 

10 absence of Hise-tagged IN, Inip76 does not bind to Ni-NTA confirming that all Inip76 
foxmd in the samples is indeed associated with DM. 

The fractions containing IN and Imp76 were pooled together and injected iato a 
1 ml HiTrap SP Sepharose column (Amersham-Phannacia) equiUbrated with 35 % 
chromatography buffer B in A (buffer A was 1 mM MgCVSO mM Hepes, pH 7.2, buffer 

15 B was 1 M NaCl/1 mM MgCVSO mM Hepes, pH 7.2). The column was washed with 
35 % B at 0.7 ml/min and the bound proteins were eluted in a 15 ml hnear gradient from 
35 % B to 90 % B. The host-derived contaminating proteins, including the 70 kDa band, 
as well as free IN protein were foimd in the flow-through fraction (FIG. 9C). The Inip76- 
IN complex eluted in approxunately 55 % B, corresponding to 550 mM NaCl (fractions 

20 16-22 in FIG. 9C). The fractions containing Inip76 and IN were pooled and concentrated 
by ultrafiltration using Centricon-30 (Millipore) to a final volume of approximately 70 [xl 
(FIG. lOA). The final protein concentration was determined to be 4.8 mg/ml using BCA 
kit (Pierce) and BioRad protein assay (BioRad) with bovine serum albumin as a standard. 
The protein was kept at 4 °C for a week without signs of aggregation. In contrast, free 

25 HIV IN was not soluble at these concentrations in buffers with physiological pH. 
Detergents such as CHAPS or NP40 were needed to keep the protein from aggregation 
even at protein concentrations below 1 mg/ml. 

Example 12: His Taslntesrase Pull-down Assay 
30 Binding of IN to LEDGF/p75 was assayed in 25 mM Tris-HCl, pH 7.4, 0.1% Nonidet P- 
40, 20 mM imidazole containing 100 or 400 noM NaCl, in the presence of 1 mM MgCli 
(binding buffer). 1 fig of recombinant Hisg-tagged HIV-1 IN, HIV-2 IN or HTLV-2 IN 
was incubated with 1-3 |xg of LEDGF/p75 in 200 fil of bmding buffer supplemented with 
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2 |ig of bovine serum albumin (BSA). Following a 30-min incubation at 4°C, the mixtures 
were centrifuged to remove aggregated protein. The samples were supplemented with 40 
|xl of Ni-NTA-agarose and stirred for an additional 30 min. The agarose beads were 
recovered by centrifugation and washed with 500 |il of binding buffer. Boimd proteins 
5 were eluted in 40 ixl of binding buffer supplemented with 200 roM inaidazole and 1% 
SDS and analyzed by SDS-PAGE electrophoresis followed by staining with Coomassie- 
R250. We obtained the pRPlOlB plasmid, coding for HIV-2 IN, from Dr. R. P. Plasterk, 
Nederlands Kankeiinstituut, Amsterdam, pHTLV2 was obtained from C. Jonsson, New 
Mexico State University, New Mexico (Wang et al, 2001). Both His-tagged proteins 
10 were purified using Ni^**>ITA and Heparin colums. 

Example 13: Stable knock-down ofLEDGF/v75 and relating experiments remrdiner HTV- 
1 replication 
Cell culture 

15 MOLT-4 cells, clone 8 (Kikukawa et al, 1986) were grown in a himiidified atmosphere 
with 5% CO2 at 3TC and maintained in RPMI 1640 medium supplemented with 10% 
heat-inactivated fetal calf serum, 2 mM L-glutamiae, 0.1% sodium bicarbonate and 20 
^g/ml of gentamycin. 

Transduction ofMOLT-A cells with a retroviral vector for siRNA delivery. 
20 Molt-4 cells were transduced with the retroviral vector MSCV/U6-p75. The MSCV/u6- 

cont was used as a control. Both vectors are described in Silver and Devroe, 2002. 

Briefly, two complementary oligonucleotides correspondiag to the positions 1420-1441 

(Stephane maybe you can indicate the positions of the different siRNA's on the Figure?) 

of the open reading frame of p75 (GenBank accession nxmiber AF063020) were ligated 
25 into the pMSCVpuro (Clontech) m order to create the MSCV/U6-p75 vector. For the 

construction of MSCV/U6-cont, an irrelevant hairpin construct was used. Forty-eight 

hours post-transduction, selection of transduced cells was started by addition of 

puromycin. The LEDGF/p75 knock-down was verified m the MSCV/U6-p75 transduced 

polyclonal MOLT-4 cells by Westem blotting. 
30 For westem blottmg we used mouse monoclonal antibodies (BD Biosciensces).and the 

ECL+ chemiluminescent detection system (Amersham Biosciences) with gout anti-mouse 

antibodies CDAKO, Demnark) and horseradish peroxidase. 
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HIV infection 

Approximately 10^ MOLT-4 cells stably transduced with MSCV/U6-p75 or with 
MSCV/u6-cont were pelleted and incubated with HIV-1(IIIb) for two hours at different 
virus dilutions. Cells were washed twice with RPMI and resuspended in 10 ml RPMI, 
5 containing puromycin at a concentration of 1 fig/ml. The cells were subcultured at 
comparable density every 2 to 3 days. Cell-free supematans was analysed for p24 
determination by ELISA (HIV-1 p24 Core Profile ELISA, duPO?T, Dreieich, Germany). 
Every 2 to 3 days, the cell cultures were monitored for the appearance of HIV-induced 
cytopathogenic effect. Uninfected but stably transduced cells were used as a negative 
10 control. 



Results 

15 Examvle 14: HIV-1 IN is present in the insoluble nuclear fi'-action 

The 293T-IN^ala cell line used in this work was similar to the previously reported 293T- 
IN^, except that it expresses HIV-1 IN with the Met-Ala dipeptide at its N-terminus 
(instead of Met-Gly) (Cherepanov et al., 2000). This change was introduced to prevent 
potential myristoylation of the protein (Boutin, 1997) and affected neither cell line 

20 stability nor IN expression levels. The integrase protein was nuclear in both cell lines as 
determined by indirect immunofluorescence microscopy (data not shown and Cherepanov 
et al., 2000). After centrifugation of the digitonin- or NP40-treated 293T-IN^ala cells, 
most of tiie IN protein was retained in the nuclear pellet (Figure lA). Since NP40 
permeabilizes both the plasma membrane and the nuclear envelope, the bulk of IN present 

25 in the cell is thus stably associated with insoluble nuclear structures. IN was extracted 
from the nuclei by increased salt concentrations; 350-500 mM NaCl was sufficient to 
elute the protein (Figure IB). Therefore, buffers supplemented with 400 mM NaCl were 
used for preparing nuclear salt extracts. 

Some chromatin-associated proteins can be eluted from detergent-permeabiUzed 

30 nuclei by gentle treatment with DNAse I or micrococcal nuclease (Fujita et al., 1997; 
Holthoff et al., 1998; Meller and Fisher, 1995). We observed no elution of IN when nuclei 
prepared from 293T-IN*ala cells were treated with DNase I (Figure IC), that completely 
digested nuclear DNA to fragments of less then 200 bp (data not shown). In accordance 
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with Fujita et al. (Fujita et al., 1997), MCM3, a chromosomal repUcation factor, was 
removed from the nuclei by gentle DNasel treatment (Figure IC). The same salt 
concentration was required to extract IN from the nuclease-digested nuclei (data not 
shown). However, subjection of the DNase-digested nuclei to low ionic strength 
5 conditions led to efficient elution of the protein (Figure ID). Non-digested nuclei did not 
release any detectable amount of IN in the hypotonic conditions (Figure ID). These 
results indicate that IN is associated both with chromosomal DNA and witti some other 
nuclear structure, which is destabilized in low ionic strength conditions. Similar results 
were obtained when 293T-IN*ala nuclei were exposed to micrococcal nuclease (data not 
10 shown). 

Example 15: Chemical Cross-linking of IN Complexes Present in Nuclear Extracts 

We used 3,3'-Dithiobis[sulfosuccinimidyl propionate] (DTSSP), an amine-specific N- 

hydroxysuccinimid ester with a 12 A spacer arm, to cross-link protein complexes present 

15 in the nuclear extracts of 293T-IN^ala cells. The nuclear proteins were extracted from the 
NP40-treated 293T-IN^ala nuclei using the cytoskeleton buffer containing 400 mM NaCl 
(400CSK). The total protein concentration was adjusted to 100, 20 and 4 jag/ml; the 
samples were incubated with DTSSP and separated in a non-reducing 4-12% SDS PAGE 
gel. IN-containing cross-linking adducts were detected by western blotting using 

20 polyclonal anti-IN antibodies. A typical result is shown in Figure 2A. In the non-cross- 
linked samples (lanes 2, 6 and 10) as well as in the samples cross-linked in the presence 
of SDS (lane 1) only IN monomer and a band corresponding to IN dimer (approximately 
60kDa) were apparent. Addition of 0.1-2 mM DTSSP yielded cross-linked complexes of 
60 kDa (ip60% 150-180 kDa (the pl50^' band, clearly visible in lanes 7, 8 and 12), 250- 

25 350 kDa (p300''\ lanes 7 and 8) and less resolved higher molecular weight (MW) species. 
Strikingly, detection of the cross-linked IN complexes with our polyclonal anti-IN 
antibodies was far more sensitive than detection of the non-cross-Unked IN (compare the 
lanes 2 and 4, for example). Probably, some strong conformational epitopes are better 
preserved within cross-linked DSf during SDS PAGE and western blotting. No unspecific 

30 bands were revealed in nuclear extracts from 293T cells before and after cross-linking 
with DTSSP confirming that all bands detected in the western blot correspond to IN- 
containing complexes (data not shown). The cross-linking of the 293T-IN*ala nuclear 
extract was clearly dependent on the concentration of both DTSSP and protein. Saturating 
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cross-linking of diluted protein samples followed by denaturing SDS PAGE has been 
used to determine MWs of native protein complexes (see (Corey et al., 1998) and 
references thorein). Cross-linking of the diluted nuclear extract (4 \xgfml protein) with 
2 mM DTSSP yielded a band of ^proximately 150 kDa (pl5(f\ lane 16), while in more 
5 concentrated extracts (1 00 Hg/ml) a diffuse band of 250-300 kDa was the most prominent 
(pSOO''', lane 8). Thus, there exist at least two different IN complexes: a large complex at 
highw protein concentrations and a smaller complex in the diluted extract. The plSO"' 
product seems to be the result of complete cross-linking since no significant change in 
cross-linking occurs when the concentration of DTSSP was increased from 0.5 to 2mM 

10 (compare lanes 15 and 16) and higher (not shown). Thus, pl50''* probably represents the 
IN complex present in the diluted nuclear extract. Moreover, this complex is a 
dissociation product and importantly, a component of the larger complex, since it 
speared as a partial cross-linking adduct in the reactions with more concentrated protein 
extracts at 0.5 mM DTSSP and it decreased at 2 mM DTSSP (con^are lanes 7 and 8). 

1 5 However, the pSOO*"' band is probably not the result of complete cross-linking of the larger 
complex, since a strong smear and some less resolved bands are present above pSOO"' on 
the western blot; aggregated material not able to enter 4% acrylamide gel is also evident 
(lanes 8 and 12). Some of the high MW adducts in the reactions with ICQ |ng/ml extracts 
may result from non-specific inter-molecular cross-linking of proteins. Cross-Unking of 

20 IN complexes in the nuclear extracts using oxidizing Cu^'*"-[l,10-phenanthroline]3 
complex (Cys-Cys cross-linker (Ji, 1983)) were also suggestive for the presence of a large 
protein complex that dissociated upon dilution releasing a molecule of approximately 
150 kDa (data not shown). 

25 Example 16: Apparent S tokes Radii of the two Nuclear Inte^ase Complexes 

To confirm the presence of both IN complexes and deduce their size we used gel 
filtration. Undiluted (550 ng/ml protein) and diluted (30 fig/ml) nuclear salt extracts fix>m 
293T-IN*aIa cells were run on a calibrated Superdex 200 coluirm and the IN elution was 
followed by immunoblotting the collected firactions (Figure 3A). We observed two 

30 distinct elution volumes corresponding to two different IN complexes. After 
chromatography of the undiluted extract, IN eluted symmetrically with a peak maximum 
in fractions 8 and 9 (elution volume, Ve = 1 1.2 ml; Stokes radius, Rs = 61 A, see Figure 
3B). However, IN eluted later and in a broader peak with a Ve of 14.1 ml (R^ = 41 A, 
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Figures 3A and 3B) when the sample was diluted prior to gel filtration. Assuming both 
complexes are globular, their MWs can be calculated to be 380 kDa and 115 kDa 
respectively (Figure 3C). The smaller dilution-resistant molecule (Rs = 41 A) most likely 
corresponds to the pi 50^^ cross-linked complex observed in the previous experiment, 
5 while partial cross-linking of the 61 A IN complex probably resulted in p300*^\ The 
effective dilution of the peak fractions on the column was 5- to 7- fold as calculated from 
calibration runs (data not shown). Thus, the sample concentrations used in our gel 
filtration experiments corresponded to those used in the cross-linking expieriments 
(100 vig/ml and 4 |ig/ml). 

10 

Example 1 7: Purification and Characterization of FLAG-Tassed IN Complexes 
To facilitate isolation of native IN complexes from cell extracts we modified the IN 
expression construct adding the FLAG epitope tag at the carboxy terminus of IN. The 
293T-IN^alaFLAG cell line, obtained by stable transfection of 293T cells with the tagged 
15 expression construct, was very similar to 293T-IN*ala in stability and levels of IN 
expression (not shown), FLAG-tagged IN (INf) localized predominantly in the nuclei in a 
diffuse patt^ and was associated with chromosomes during mitosis (see below) as has 
been previously reported for non-tagged HTV-l IN (Cherepanov et aL, 2000). INf could 
be extracted from the nuclei of 293T-IN*alaFLAG cells in the same conditions as for non- 
20 tagged IN. The cross-linking pattern of INf with DTSSP was very similar to that of non- 
tagged IN (Figure 2B). The two major cross-linking products of INf showed slightly 
slower migration in SDS PAGE gels than the original pi 50^^ and p300^\ which can be 
attributed to the negative charge of the FLAG tag and increased molecular weight of the 
tagged protein. For convenience, however, we refer to the INf cross-linking adducts as 
25 pi 50^^ and p300^\ The gel filtration profiles were as observed for the non-tagged IN 
extracted from 293T-IN^ala cells (see below and data not shown). 

In initial immunoprecipitation experiments, we incubated diluted nuclear extracts 
from metabolically labeled 293T-IN^alaFLAG cells with the anti-FLAG M2 antibody and 
protein G agarose overnight. The protein isolated in this way displayed a single specific 
30 band in SDS PAGE gels migrating at the expected position for the FLAG-tagged IN 
(33-5 kDa) (Figure 4A). Isoelectrofocusing of immunoprecipitated INf in denaturing pH 
gradients showed a major band close to the predicted pi (6.5), which reacted with anti-IN 
serum in immunoblot (data not shown). When the INf immunoprecipitated from a nuclear 
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extract of 293T-IN^alaFLAG cells was eluted from the anti-FLAG M2 antibody with 
synthetic FLAG peptide and incubated with DTSSP, the plSO*^^ cross-linking product was 
readily obtained (Figure 4B). When higher INf concentrations were used in cross-linking, 
the immunoreactive reaction products accumulated at the top of the gel, suggesting 
5 aggregation of the protein (data not shown). We were not able to find reaction conditions 
to reproduce the p300*^^ cross-linking product with DSFf preparations purified this way. 
Fractionation of pxuified INf on a Superdex column showed a peak with a Kav value very 
close to that of the 41 A complex (Figure 4C). The fact that the 41 A complex exists in 
the purified INf preparation and the apparent MW (115-150 kDa), based on cross-linking 

10 and gel filtration experiments, suggest that the 41 A molecule is a tetramer of IN. 
Apparently, the native 61 A IN complex was not stable enough to withstand 
immxmoprecipitation xmder these conditions. Next, we tried shorter immunoprecipitation 
times (3-5 hours) starting from more concentrated nuclear extracts. Although the overall 
yield of INf decreased, the immimoprecipitated samples were found to contain an 

15 additional protein. It had an apparent MW of approximately 76 kDa, as determined by 
SDS PAGE (Figure 5A) and was present at variable ratios to INf in different preps. This 
protein, here referred to as Inip76 (for 76 kDa IN interactor protein), is specifically 
associated with INf, since, it could not be immunoprecipitated from the parental 293T 
cells with the anti-FLAG antibody (compare lanes 2 and 3 in Figure 5A). When undiluted 

20 nuclear extract from 293T-IN^alaFLAG cells was inmiunoprecipitated with anti-FLAG 
antibody for 4 hoxjrs both INf and Iaip76 bands were readily detected (Figure 5B). 
Although extending immunoprecipitation to 16 hours improved INf recovery, the yields 
of Inip76 were greatly reduced (compare the lanes 1 and 2 in Figure 5B). Intriguingly, the 
p300^^ band, detected after DTSSP cross-linking of the nuclear salt extracts, was readily 

25 observed when the Inip76-containing INf preparations were cross-linked with DTSSP 
(Figure 6E), suggesting that Inip76 is part of a large IN complex present in the nuclear 
extracts. 

Example 18: Identification of the IniD76 Protein as LEDGF/DFS70/v75 
30 By upscaling immunoprecipitation we were able to isolate sufficient amoxmts of Inip76 
for characterization by Edman degradation and mass spectrometry (Figure 6A). The 
amino terminal sequence obtained firom Inip76 was XXRDFKPGD (the first two residues 
being not resolved due to background noise). Scanning the TrEMBL protein database for 
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human proteins carrying this sequence tag (Wilkins et al., 1996) resulted in four hits with 
accession numbers 095368, Q91IER6, Q9NZB, 075475, aU corresponding to the two 
alternative products of one gene: LEDGF/DFS70/p75 (lens epitheUum-derived growth 
factor, referred hereafter as LEDGF) and the p52 protein ((Ge et al., 1998; Singh et al., 
5 2000)). Although the actual MW of LEDGF is approximately 60 kDa, it has been 
reported to migrate as a 75 kDa band m SDS PAGE gels (Ge et al., 1998). On-line Uquid 
chromatography tandem mass spectrometry analysis (LCVMS/MS) of tryptic peptides 
obtained by in-gel digestion of Inip76 provided further evidence that Inip76 is indeed 
identical to LEDGF. Lens epitiieUum-derived growth factor (LEDGF), is a member of the 

10 epatoma-derived growth factor family, is found at low levels in many actively dividing 
and long Uved cells. Its gene yields tow proteins, LEDGF/p75 and p52, by alternative 
splicing (Ge, H., Si, Y., and Roeder, R. G. (1998) EMBO J. 17, 6723-6729 and 2). 
LEDGF belongs to a family of homologous proteins including hepatoma-derived growth 
factor (HDGF) (Nakamura, H, Izumoto, Y, Kambe, H, et al (1994) Molecular clinmg of 

15 complementary DNA for a novel hiraian hepatoma-derived growth factor: its homology 
with high mobility group-l protein J Biol Chem 269, 25143-25149) and HDGF-related 
protein-1 and -2. (Izumoto, Y, Kuroda, T, Harada, H, Kishimoto, T, Nakamura, H. (1997) 
Hq)atoma-derived growth factor belongs to a gene family in mice showing significant 
homology in the amino terminus Biochem Biophys Res Commun 238,26-32). 

20 Half of the predicted LEDGF tryptic peptides within the mass range of 1000-2500 Da 
could be identified in the sample and their MS/MS spectra readily matched LEDGF 
covering approximately 18% of its sequence (Figure 6B and Tablet). Moreover, Inip76 
strongly reacted widi a commercially available monoclonal anti-LEDGF antibody (see 
below and data not shown). Most of INf present in nuclear extracts could be 

25 immunoprecipitated with the anti-LEDGF antibody, whereas only about 10 % of LEDGF 
could be recovered witii the anti-FLAG antibody (Figure 6C), suggesting that LEDGF is 
present in an excess over INf in the extract. Non-tagged EST could also be efficiently 
precipitated with the anti-LEDGF antibody, and a fiaction of tiie LEDGF could be 
precipitated with polyclonal anti-IN antibody in sinailar conditions &om nuclear salt 

30 extracts prepared from 293T-IN*ala cells. 
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Example 19: Inw76 is part of the 61 A HIV IN comvlex 

To determine whether the 61 A complex contains Inip76/LEDGF, we pre-incubated the 
nuclear salt extract from 293T-IN®aIaFLAG cells with a monoclonal anti-LEDGF 
5 antibody prior to chromatography on a Superdex column. The INf elution profile changed 
dramatically: the peak eluted now near the void volume of the column (Figure 6D). 
Elution of the 61 A complex was not altered by pre-incubation of the extract with an 
unrelated mouse IgGl (Figure 6D). Predictably, elution of the 41 A IN complex (the 
presumed IN tetramer) did not change after pre-incubation of the diluted nuclear extracts 

1 0 with the anti-LEDGF antibody. 

When INf Inip76 complex was purified by inmiunoprecipitation and cross-linked 
with DTSSP, the p300^^ band could be readily detected in an immunoblot with anti-EST 
antibody (Figure 6E). However, pSOO''^ did not react with a monoclonal anti-LEDGF 
antibody; instead a western blot with the anti-LEDGF antibody revealed two bands 

15 migrating at higher positions in the gel (pHMWi''^ and pHMW2''\ Figure 6E) (the MWs of 
these molecules are too high to be determined from SDS PAGE). These results suggest 
that p300^^ is a product of incomplete cross-linking of the 61 A IN-Inip76 complex. Being 
composed of IN alone, p300^' probably represents an octamer of IN (a dimer of 
tetramers). Contacts between Inip76 and IN within the 61 A complex may be less prone 

20 to cross-linking with DTSSP than those between IN protomers. We camiot, however, 
exclude the possibiUty that the target epitope for the monoclonal anti-LEDGF antibody 
used is masked or destroyed within pSOO*'^ Jn addition to the major p300^^ product, a band 
at a position close to pHMWi*'' is present on the anti-IN immunoblot of the purified and 
cross-linked INrInip76 complex (Figure 6E, lane 2). Thus, the pHMWi""^ adduct is 

25 probably the smallest cross-linked IN complex containing Inip76. 

Example 20: Inw76 co-localizes with IN within nuclei of293T'IhFalaFLAG cells 
In mammalian cells, LEDGF is expressed as a nuclear protein; it is distributed in a diffuse 
manner throughout the nucleus during interphase and is stably associated with condensed 
30 chromosomes throughout metaphase and anaphase (Nishizawa et al., 2001). A similar 
subcellular distribution has been reported for HIV-1 IN (Cherepanov et aL, 2000). 
Immunofluorescent detection of both INf and Inip76/LEDGF in fixed 293T-IN[^ala cells 
revealed strikingly similar intranuclear distribution patterns for both protems (Figure 7A). 
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In accordance with previous reports, both proteins were boiind to condensed 
chromosomes in mitotic cells (Figure 7B), The distribution of another nuclear protein, the 
catalytic subunit of DNA-dependent protein kinase (DNA PKcs) clearly differed from 
that of INf (Figure 7C). In addition, DNA PKcs was excluded from condensed 
5 chromosomes in mitotic cells (data not shown). Intriguingly, the nuclear localization of 
INf and Inip76 did not correspond to the overall DNA staining pattem, arguing against the 
possibihty that the apparent co-locaUzation of the two proteins might be merely due to 
their independent association with chromosomal DNA. In a control experiment, we 
visualized INf using a mixture of polyclonal and monoclonal anti-FLAG antibodies; the 
10 obtained two-color INf staining was similar to that of INf and Inip76 (data not shown). 

Example 21: Inip76 is an activator of HTV-l IN in vitro 

We have previously described activities of recombinant HIV-1 IN on the mini-HIV 
substrate, a linear 4.7 kb double stranded DNA molecule, carrying the U3 and U5 

15 terminal fragments of the viral LTR sequences (Cherepanov et al., 1999). Recombinant 
HIV-1 IN on itself was proficient in carrying-out cleavage and strand transfer using this 
long DNA substrate; however, presence of 5-12% polyethylene glycol (PEG) in the 
reaction was required for the enzymatic activity. To ascribe a possible function to the 
observed Iaip76-IN interaction, we examined whether Inip76 could modulate enzymatic 

20 activity of HIV-1 IN in vitro. The mini-HIV substrate was incubated with recombinant IN 
and His6-tagged Inip76 in the absence of PEG, and the reaction products were analyzed 
by native agarose gel electrophoresis (Figure 6F). In the absence of Imp76, strand transfer 
products were almost imdetectable (lane 3). Addition of Inip76 resulted in a potent 
stimulation of the reaction. In some conditions, approximately half of the substrate DNA 

25 was converted into various reaction products includiug those that were too large to enter 
the gel (lanes 7 and 8). Remarkably, both the overall efficiency of the reaction and the 
range of the strand transfer products depended on the concentration of Inip76. No 
significant variation in the yield of the strand transfer products was detected when the 
order of addition of Imp76 and IN to the mini-HIV reaction was reversed (data not 

30 shown). Althougji addition of Inip76 to IN reactions containing short oligonucleotide 
substrate stimulated the strand transfer activity, presence of PEG and DMSO was still 
essential for efficient Mg^'*'-dependent activity of our recombinant enzyme preparation 
(Caierepanov et al., 1997)). 
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Example 22: Disruption o f the nuclear and chromosomal localization of HIV IN usins 
small interferim^ RNA molecules specific to thelnip76 mRNA. 

5 Immimofluorescent detection of Inip76 and IN revealed that Inip76 expression 

was efficiently knocked-down in the majority of cells transfected with 76A, 76B and 76C 
molecules. Moreover, IN re-distributed from the nuclei to the cytoplasm of the transfected 
cells as illustrated in the FIG. 8B, Importantly, after knock-down of Inip76 expression, IN 
was not able to bind to chromosomal DNA, which was evident in all mitotic cells with 
10 condensed chromosomes (FIG. 8C). The distribution and expression of IN and Inip76 
remained unchanged in mock-transfected cells or cells transfected with NC RNA (FIG. 
8A). 

Example 23: Recombinant Inip76-IN Complex Produced in Escherichia coli: Activity of 
15 the Purified Recombinant IN'Inip76 Complex 

The purified complex was supplemented with the IN mini-HIV substrate in the 
conditions similar to the mini-HIV reactions described above. The final reactions 
contained 1.5, 0.75, 0.37, 0.18 and 0 ^ig of the IN-Itiip76 complex, 150 ng of the mini- 

20 mV substrate in the final volxmie of 20 The reactions also contained 110 mM NaCl, 
5 mM MgCl2, 5 mM DTT, 20 mM Hepes, pH 7.5. After 90 min incubation at 37 °C, the 
reactions mixtures were supplemented with 0.2 % SDS, 2 mM EDTA plus 5 |j,g of 
proteinase K and further incubated at 37 °C for 20 min, to disrupt all protein-DNA 
complexes. The reaction products separated in 0.8 % agarose gel were detected by 

25 staining wilii the SybrGold stain (Molecular Probes) (FIG. lOB). The activity of the 
complex was similar to the activity of IN in the presence of separately added Inip76 as 
shown above. 

Example 24: His Ta^ Intesrase Pull-down assay - Recombinant LEDGF/p75 forms a 
30 stable complex with HIV-l IN and HIV'2 IN. but not with HTLV'2 IN, 

We have previously shown that His-tagged HIV-1 IN bound to a Ni^'*'-chelating resm 
stably interacts with LEDGF/p75 in solution (Maertens et al., 2003). The specificity of 
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this assay was demonstrated by the lack of interaction with LEDGF/p52. To verify 
whether LEDGF interacts with all human retroviral integrases, we repeated this assay 
using His-tagged HIV-2 and HTLV-2 IN (Figure X). Interestingly, LEDGF/p75 
interacted with integrase of lentiviral HIV-2 but not with integrase of retroviral HTLV-2, 
5 We have previously shown that LEDGF/p75 increases the solubility of HIV-1 integrase; 
this is not flie case with HTLV-2 IN (data not shown). These experiments suggest that 
LEDGF/p75 is a lentiviral-specific co-factor. 

Example 25: Knock-down ofLEDGF/vJS with vector m ediated RNAi 

10 

Polyclonal MOLT-4-derived cell lines with stable knock-down of LEDGF/p75 were 
created using retroviral vectors that encode a small hairpin RNA targeting specifically 
LEDGF/p75 but not p52. Stable knock-down (up to 90%) of LEDGF/p75 expression was 
verified by western blotting (Figure 12A). A control MOLT-4 cell line stably expressing 
15 an unrelated hairpin was selected in parallel. Growth kinetics of both selected cell lines 
were comparable. 

The cell lines were infected with 10-fold dilutions of HIV-1(IIIb) and virus breakthrough 
was followed in time (Figure 12B). A clear delay of HIV replication was observed in the 
cell lines expressing reduced levels of LEDGF/p75 at all virus dilutions tested. 
20 Replication of HIV-1 added at an moi of 2.10"^ was even completely abolished. These 
data validate LEDGF/p75 as an antiviral target. 
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